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Description 

This invention relates to Bacillus strains. We describe below such strains useful for the expression and secretion 
of desired polypeptides (as used herein, 'polypeptide' means any useful chain of amino acids, including proteins). 
5 Bacillus strains have been used as hosts to express heterologous polypeptides from genetically engineered vec- 

tors. The use of a Gram positive host such as Bacillus avoids some of the problems associated with expressing het- 
erologous genes in Gram negative organisms such as.E. coli. For example, Gram negative organisms produce endo- 
toxins which may be difficult to separate from a desired product. Furthermore, Gram negative organisms such as 
coli are not easily adapted for the secretion of foreign products, and the recovery of products sequestered within the 

10 cells is time-consuming, tedious, and potentially problematic. In addition, Bacillus strains are non-pathogenic and are 
capable of secreting proteins by well-characterized mechanisms. 

A general problem in using Bacillus host strains in expression systems is that they produce large amounts of 
proteases which can degrade heterologous polypeptides before they can be recovered from the culture media. The 
proteases which are responsible for the majority of this proteolytic activity are produced at the end of the exponential 

'5 phase of growth, under conditions of nutrient deprivation, as the cells prepare for sporulation. The two major extracel- 
lular proteases an alkaline serine protease (subtilisin), the product of the apr gene, and a neutral metal lop rotease, the 
product of the np_r gene, are secreted into the medium, whereas the major intracellular serine protease, lsp-1 , is pro- 
duced within the cells. Other investigators have created genetically altered Bacillus strains that produce below-normal 
levels of one or more of these three proteases, but these strains still produce high enough levels of protease to cause 

20 the degradation of heterologous gene products prior to purification. 

Stahl et al. (J. Bact., 1984, Jl58:411) disclose a Bacillus protease mutant in which the chromosomal subtilisin 
structural gene was replaced with an in vitro derived deletion mutation. Strains carrying this mutation produced only 
10% of the wild-type extracellular serine protease activity. Yang et al. (J. Bact., 1984, j[60:15) disclose a Bacillus 
protease mutant in which the chromosomal neutral protease gene was replaced with a gene having an in vitro derived 

2S deletion mutation. Fahnestock et al. (WO B6/01825) describe Bacillus strains lacking subtilisin activity which were 
constructed by replacing the native chromosomal gene sequence with a partially homologous DNA sequence having 
an inactivating segment inserted into it. Kawamura et aL (J. Bact., 19B4, 160 :442) disclose Bacillus strains carrying 
lesions in the np_r and apr genes and expressing less than 4% of the wild-type level of extracellular protease activity. 
Koide etal. (J. Bact., 1986, 167 :110) disclose the cloning and sequencing of the isp-1 gene and the construction of an 

30 lsp-1 negative mutant by chromosomal integration of an artificially deleted gene. 

Genetically altered strains which are deleted for the extracellular protease genes (apr and np_r) produce significantly 
lower levels of protease activity than do wild-type Bacillus strains. These bacteria, when grown on medium containing 
a protease substrate, exhibit little or no proteolytic activity, as measured by the lack of appearance of a zone of clearing 
(halo) around the colonies. Some heterologous polypeptides and proteins produced from these double mutants are, 

35 nevertheless, substantially degraded prior to purification, although they are more stable than when produced in a wild- 
type strain of Bacillus . 

The invention provides improved Bacillus cells containing mutations in one or more of three previously uncharac- 
terized protease genes; the cells also preferably contain mutations in the ap_r and np_r genes that encode the major 
extracellular proteases, resulting in the inhibition by the cells of production of these extracellular proteases. The mu- 

*o tations of the invention include a mutation in the ep_r gene which inhibits the production by the cell of the proteolytically 
active egr gene product, and/or a mutation in the gene (herein, the "RP-l" gene) encoding residual protease I (RP-I) 
which inhibits the production by the cell of proteolytically active RP-I, and/or a mutation in the gene (herein, the "RP- 
II" gene) encoding residual protease II (RP-II). The. proteases encoded by the ep_r gene and RP-II genes are novel 
proteins. Most preferably, the mutations are deletions within the coding region of the genes, including deletion of the 

*s entire coding region; alternatively, a mutation can consist of a substitution of one or more base pairs for naturally 
occuring base pairs, or an insertion within the rotease coding region. 

Bacillus cells in accordance with the invention may additionally contain a mutation in the jsg-1 gene encoding 
intracellular serine protease I and may in addition contain a mutation which blocks sporulation and thus reduces the 
cell's capacity to produce sporulation<lependent proteases; preferably, this mutation blocks sporulation at an early 

50 stage but does not eliminate the cell's ability to be transformed by purified DNA; most preferably, this mutation is the 
spo OA mutation (described below). 

The invention provides, in an alternative aspect thereof, a method for producing stable heterologous polypeptides 
in a Bacillus host cell by modifying the host to contain mutations in the apr and np_r genes and in one or more of the 
genes including the epr gene, the RP-I gene, and the RP-II gene. 

55 The invention also features, in respective further aspects thereof, purified DNA, expression vectors containing 

DNA, and host Bacillus cells transformed with DNA, in each case encoding one of the proteases RP-I, RP-II, or the 
product of the ep_r gene; preferably, such DNA is derived from Bacillus subtilis . 

The invention also features, in yet further aspects thereof, the isolation of substantially pure Epr, residual protease 
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I (RP-I), and another previously uncharacterised protease called residual protease II (RPN), and characterisation of 
the RP-I and RP-II proteases; as used herein, 'substantially pure" means greater than 90% pure by weight. 

The terms "epr gene", "RP-I gene", and "RP-II gene" herein mean the respective genes corresponding to these 
designations in Bacillus subtilis , and the evolutionary homologues of those genes in other Bacillus species, which 
s homologues, as is the case for other Bacillus proteins, can be expected to vary in minor respects from species to 
species. The RP-I and RP-II genes of B. subtilis are also designated, respectively, the bpr and mpr genes. In many 
cases, sequence homology between evolutionary homologues is great enough so that a gene derived from one species 
can be used as a hybridization probe to obtain the evolutionary homologue from another species, using standard 
techniques. In addition, of course, those terms also include genes in which base changes have been made which, 
w because of the redundancy of the genetic code, do not change the encoded amino acid residue. 

Using the procedures described herein, we have produced Bacillus strains which are significantly reduced in their 
ability to produce proteases, and are therefore useful as hosts for the expression, without significant degradation, of 
heterologous polypeptides capable of being secreted into the culture medium. We have found that our Bacillus cells, 
even though containing several mutations in genes encoding related activities, are not only viable but healthy. 
*5 Any desired polypeptide can be expressed using our techniques, e.g., medically useful proteins such as hormones, 

vaccines, antiviral proteins, antitumor proteins, antibodies or clotting proteins; and agriculturally and industrially useful 
proteins such as enzymes or pesticides, and any other polypeptide that is unstable in Bacillus hosts that contain one 
or more of the proteases inhibited in our cells. 

Other features and advantages of the invention will be apparent from the following description of preferred em- 
20 bodiments thereof. 

The drawings will first be briefly described. 

Fig. 1 is a series of diagrammatic representations of the plasmids p371 and p371A, which contain a 2.4 kb Hindlll 
insert encoding the Bacillus subtilis neutral protease gene and the same insert with a deletion in the neutral protease 
gene, respectively, and p371ACM, which contains the Bacillus cat gene. 
25 Fig. 2 is a Southern blot of Hindlll digested IS75 and IS75NA DNA probed with a 32 P-labeled oligonucleotide 

corresponding to part of the nucleotide sequence of the np_r gene. 

Fig. 3 is a representation of the 6.5 kb insert of plasmid pAS007, which encodes the Bacillus subtilis subtilisin 
gene, and the construction of the deletion plasmid pAS13. 

Fig. 4 is a representation of the plasmid plSP-1 containing a 2.7 kb Bam HI insert which encodes the intracellular 
30 serine protease ISP-1, and the construction of the ISP-1 deletion plasmid pAL6. 

Fig. 5 is a diagrammatic representation of the cloned epr gene, showing restriction enzyme recognition sites. 

Fig. 6 is the DNA sequence of the egr gene. 

Fig. 7 is a diagrammatic representation of the construction of the plasmid pNP9, which contains the deleted epr 
gene and the Bacillus cat gene. 
35 Fig. 8 is the amino acid sequence of the first 28 residues of Rp-I and the corresponding DNA sequence of the 

probe used to clone the RP-I gene. 

Fig. 9 is a restriction map of the 6.5kb insert of plasmid pCR83, which encodes the RP-I protein. 

Fig. 10 is the DNA sequence of DNA encoding RP-l protease. 

Fig. 11 is the amino acid sequence of three internal RP-II fragments (a, b, c), and the nucleotide sequence of three 
40 guess-mers used to clone the gene (a), (b) and (c). 

Fig. 12 is a Southern blot of GP241 chromosomal DNA probed with BRT90 and 707. 

Fig. 1 3 is a diagram of (a) a restriction map of the 3.6 kb Pstl insert of pLPI, (b) the construction of the deleted RP- 
il gene and <c) the plasmid used to create an RP-II deletion in the Bacillus chromosome. 
Fig. 14 is the DNA sequence of DNA encoding RP-II. 

45 

General Strategy for Creating Protease Deleted Bacillus Strains 

The general strategy we followed for creating a Bacillus strain which is substantially devoid of proteolytic activity 
is outlined below. 

50 A deletion mutant of the two known major extracellular protease genes, apr and npr , was constructed first. The 

isp - 1 gene encoding the major intracellular protease was then deleted to create a triple protease deletion mutant. The 
spoO A mutation was introduced into either the double or triple deletion mutants to significantly reduce any sporulation 
dependent protease activity present in the cell. A gene encoding a previously unknown protease was then isolated 
and its entire nucleotide sequence was determined The gene, egr, encodes a primary product of 645 amino acids that 

55 is partially homologous to both subtilisin (Apr) and the major internal serine protease (lsp-1) of B. subtilis . A deletion 
of this gene was created in vitro and introduced into the triple protease deleted host. A deletion in a newly identified 
gene encoding residual protease RP-I was then introduced to create a strain of B. subtilis having substantially reduced 
protease activity and expressing only the RP-II activity. RP-II has been purified and a portion of the amino acid sequence 
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was determined for use in creating the nucleic acid probes which were used to clone the gene encoding this protease. 
Upon cloning the gene, it was possible to create a Bacillus strain which contains a deletion in the RP-II gene and is 
thus incapable of producing RP-II. 

Detailed procedures for construction of the protease gene deletions and preparation of Bacillus strains exhibiting 
5 reduced protease activity are described below. 

General Methods 

Our methods for the construction of a multiply deleted Bacillus strain are described below. Isolation of B. subtilis 
10 chromosomal DNA was as described by Dubnau et al., (1971, J. Mol Biol., 56: 209). B. subtilis strains were grown on 
tryptose blood agar base (Difco Laboratories) or minimal glucose medium and were made competent by the procedure 
of Anagnostopoulos et al., (J. Bact., 1961 ,81; 741). .E. coli JM107 was grown and made competent by the procedure 
of Hanahan (J. Mol. Biol., 1983. 166: 587). Plasmid DNA from B. subtilis and E. coli were prepared by the lysis method 
of Birnboim et al. (Nucl. Acid. Res., 1979, 7: 1513). Plasmid DNA transformation in B subtilis was performed as de- 
'5 scribed by Gryczan et al., (J. Bact, 1978, 134 : 138). 

Protease assays 

Two different protease substrates, azocoll and casein (Labelled either with 14 C or the chromophore resorufin), 
20 were used for protease assays, with the casein substrate being more sensitive to proteolytic activity. Culture superna- 
tant samples were assayed either 2 or 20 hours into stationary phase. Azocoll-based protease assays were performed 
by adding 100 ul of culture supernatant to 900 ul of 50 mM Tris, pH 8, 5 mM CaCl 2 , and 10 mg of azocoll (Sigma), a 
covalently modified, insoluble form of the protein collagen which releases a soluble chromophore when proteolytically 
cleaved. The solutions were incubated at 37° C for 30 minutes with constant shaking. The reactions were then centri- 
25 fuged to remove the insoluble azocoll and the A^ of the solution determined. Inhibitors were pre-incubated with the 
reaction mix for 5 minutes at 37°C. Where a very small amount of residual protease activity was to be measured, 14 C- 
casein or resorufin-labelled casein was used as the substrate. In the 14 C-casein test, culture supernatant (100 ul) was 
added to 100 ul of 50 mM Tris, 5mM CaCl 2 containing 1 Xl0 5 cpmof 14 C-casein (New England Nuclear). The solutions 
were incubated at 37°C for 30 minutes. The reactions were then placed on ice and 20 ug of BSA were added as carrier 
30 protein. Cold 1 0% TCA (600 ul) was added and the mix was kept on ice for 1 0 minutes. The solutions were centrifuged 
to spin out the precipitated protein and the supematants counted in a scintillation counter. The resorufin-labelled casein 
assay involved incubation of culture supernatant with an equal volume of resorufin labelled casein in Tris=CI buffer, 
pH 8. 0, at 37° C for various times. Following incubation, unhydrolyzed substrate was precipitated with TCA and the 
resulting chromogenic supernatant was quantitated spectrophotometry ly. 

35 

Deletion of the npr gene 

According to Yang et al. (J. Bact., 1984, 160 : 15), the npr gene is contained within overlapping EcoRI and Hindi 1 1 
restriction fragments of B. subtilis DNA, and a majority of the gene sequence is located on the 2.4 kb Hindi! I fragment. 

40 This fragment was chosen for creation of the npr deletion. 

An individual clone containing the 2.4 kb Hin dlll fragment was isolated from a clone bank of genomic Hindi 1 1 frag- 
ments prepared as follows. Chromosomal DNA was isolated from B. subtilis strain IS75, digested with Hindlll and size 
fractionated by electrophoresis on a 0.8% agarose gel. DNA in the 2-4 kb size range was electroeluted from the gel. 
The purified DNA was ligated to Hindlll digested and alkaline phosphatase treated pUC9 DNA (an .E. coli replicon 

45 commercially available from Bethesda Research Labs, Rockville, Md), transformed into competent cells of E. coli strain 
JM107, and plated on LB + 50 ug/ml ampicillin resulting in 1000 Amp R colonies. 

Colonies containing the cloned neutral protease gene fragment were identified by standard colony hybridization 
methods (Maniatis et al., 1983, "Molecular Cloning, A Laboratory Manual", Cold Spring Harbor, New York). Briefly, 
transformants are transferred to nitrocellulose filters, lysed to release the nucleic acids and probed with an npr specific 

so probe. A 20 base oligonucleotide complementary to the npr gene sequence between nucleotides 520 and 540 (Yang 
et al., supra ) was used as the probe. The sequence is 5'GGCACGCTTGTCTCAAGCAC 3'. A representative clone 
containing the 2.4 kb Hindlll insert was identified and named p371 (Fig. 1). 

A deleted form of the ngr gene in p371 was derived in vitro . A 580 bp internal Rsa l fragment was deleted by 
digesting p371 DNA with Rsa l and Hindlll. The 600 bp Hindi II -Rsa l fragment spanning the 5' end of the gene and the 

55 1220 bp Rsa l-HindlH fragment spanning the 3* end of the gene (see Fig. 1) were isolated and cloned into Hindlll and 
alkaline phosphatase treated pUC9. This resulted in the deletion of the center portion of the np_r gene. The ligated DNA 
was transformed into E. colj JM1 07. A clone having the desired deletion within the np_r gene was identified by restriction 
enzyme analysis. This plasmid is designated p371A. 
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A gene encoding a selectable marker was included on the vector to facilitate the selection of integrants in Bacillus , 
The Bacillus cat gene, encoding resistance to chloramphenicol (CnY), was isolated from plasmid pMM101 (Youngman 
et al. , 1 984, Plasmid J2: 1 -9) on a 1 .3 kb Sail fragment and cloned into the Sail site of p371 A. This DN A was transformed 
into E. coli JM107 and transformants were screened for chloramphenicol resistance. A representative plasmid con- 

5 taining both the deleted npj; gene and the cat gene was named p371 ACm (Fig. 1 ). 

The vector p371ACm was derived from the E. coli replicon pUCI9 and is therefore unable to replicate in a Bacillus 
host. The wild-type npr gene in the chromosome of the recipient host was exchanged for the deleted np_r gene contained 
on the vector by reciprocal recombination between homologous sequences. The Cm r marker gene enabled the selec- 
tion of cells into which the vector, inclusive of the protease gene sequence, had integrated. 

10 Vector sequences that integrated with the deleted njgr gene were spontaneously resolved from the chromosome 

at a low frequency, taking a copy of the n£r gene along with them. Retention of the deleted protease gene in the 
chromosome was then confirmed by assaying for the lack of protease activity in the CmF segregants. 

Specifically, competent B. subtilis IS75 cells were transformed with p371 ACm and selected for Cm r . Approximately 
2000 colonies, which had presumably integrated the deleted npr gene adjacent to, or in place of, the wild type gene, 

is were selected which were resistant to chloramphenicol. Approximately 25% of the colonies formed smaller zones of 
clearing on starch agar indicating that the wild-type gene had been replaced with the deleted form of the gene. No 
neutral protease activity was detected in supernatants from these cell cultures. In contrast, high levels of neutral pro- 
tease activity were assayed in culture fluids from wild type IS75 cells. Segregants which contained a single integrated 
copy of the deleted protease genes, but which had eliminated the vector sequences were then selected as follows. 

20 A culture of Cm r colonies was grown overnight in liquid media without selection then plated onto TBAB media. 

These colonies were then replicated onto media containing chloramphenicol and those that did not grow in the presence 
of chloramphenicol were identified and selected from the original plate. One such Npr negative colony was selected 
and designated IS75NA. 

Deletion within the n£r gene in IS75NA was confirmed by standard Southern blot analysis (Southern, 1977, J. Mol. 
25 Biol. 98: 503) of Hindi II digested DNA isolated from B. subtilis IS75N and IS75NA probed with the 32 P-labelled npr - 
specific oligonucleotide. The probe hybridized with a 2.4 kb Hindlll fragment in wild-type IS75N DNA and with a 1.8 
kb fragment in IS75N A DNA indicating that 600 bp of the np_r gene were deleted in IS75NA (see Fig. 2). 

Deletion of the apr gene 

30 

To clone the subtilisin gene (apr ) a genomic library from IS75 DNA was first prepared. Chromosomal DNA was 
isolated and digested with Eco RI and separated by electrophoresis through a 0.8% agarose gel. Fragments in the 5-8 
kb size range were purified by electroelution from the gel. The fragments were ligated with Eco RI digested pBR328 
DNA (publicly available from New England BioLabs) and transformed into competent E. coli JM1 07 cells. Transformants 

35 were screened for plasmids containing apj gene inserts by hybridizing with a synthetic ^P-labelled 17-mer oligonu- 
cleotide probe which was complementary to the agr gene sequence between nucleotides 503 and 520 (Stahl et al., 
1984, J. Bact. 158 : 411). A clone with a 6.5 kb Eco RI insert that hybridized with the probe was selected and named 
pAS007 (Fig. 3). The 6.5 kb fragment contained the entire coding sequence of the subtilisin gene. 

A mutant of the apr gene was created by deleting the two internal Hpal fragments (Fig. 3). pAS007 was first digested 

40 with Hpa l and then recircularized by ligating in a dilute solution (5ug/ml) to eliminate the two Hpa l fragments Approx- 
imately 200 Amp r colonies arose following transformation of JM107 cells. One of these transformants contained a 4.8 
kb Eco RI insert with one internal Hpa l site. It was designated pAS12. The deletion in the ap_r gene extended 500 bp 
beyond the 3' end of the gene, however this DNA apparently did not contain any genes that were essential toB. subtilis . 
A 1.3 kb Sail fragment containing the Bacillus cat gene was cloned into the Sail site of pAS12 (described above) 

45 for selection of integrants in the Bacillus host chromosome. The plasmid DNA was transformed into E. coh JM107, 
plated on media containing ampicillin and approximately 50 Amp r colonies were recovered and replica plated onto 
media containing 7.5 ug/ml chloramphenicol. Three of the 50 colonies were Cm r . Plasmid DNA was isolated from these 
three clones and analyzed by restriction digestion. One of the plasmids had the desired restriction pattern and was 
named pAS13 (Fig. 3). 

50 To promote integration of the deleted protease gene into the B. subtilis chromosome, pASI 3 was introduced into 

strain IS75NA and selected for Cm r transformants. The transformants were then screened for replacement of the wild- 
type a£r gene with the deleted gene by plating on TBAB plates containing 5 ug/ml Cm and 1.5% casein. Several of 
the colonies which did not produce halos were selected for loss of the Cm r gene as described above. A representative 
transformant was chosen and designated GP199. 

55 Protease activity was assayed in the culture fluids from the double protease deleted strain, as well as in the strain 

having only the deleted neutral protease gene. Protease activity in Npr, Apr mutant cells was approximately 4-7% 
of wild type levels whereas the Npr mutant exhibited higher levels of protease activity. 
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amyE Mutation 

Protease deficient strains were tested in connection with the production of a Bacillus amylase. To assay the levels 
of amylase produced by various plasmid constructs it was necessary to introduce a mutant amylase gene into the host 
in place of the wild type gene. This step is not essential to the present invention and does not affect the level of protease 
activity; it was performed only because plasmid encoded amylase levels could not be determined in the presence of 
the chromosomally encoded amylase. The amy E allele was transformed from B. subtilis strain JF206 (trpC2, amy E) 
into GP1 99 by a transformation/selection process known as congression. This process relies on the ability of competent 
B. subtilis cells to be transformed by more than one piece of chromosomal DNA when the transforming DNA is provided 
in excess. The process involves initial selection of competent cells in the population by assaying for expression of a 
selectable marker gene which subsequently facilitates screening for co transfer of an unselectable marker, such as 
inability to produce amylase. 

Total chromosomal DNA was isolated from JF206 or a similar strain containing an amy E mutation. Saturating 
concentrations (—lug) were transformed into competent GP1 99 (met, leu - , his - ) and His + transformants were selected 
on minimal media supplemented with methionine and leucine. The transformants were screened for an amylase minus 
phenotype on plates having a layer of top agar containing starch-azure. Five percent of the His + colonies were unable 
to produce halos indicating that the amylase gene was defective. One such transformant was assayed for the protease- 
deficient phenotype and was designated GP200. 

Supernatant samples from cultures of the double protease mutant were assayed for protease activity using azocoll 
as the substrate. When assayed on this substrate, protease activity in the double protease mutant strain was 4% of 
wild type levels. When the more sensitive substrate 14 C-casein was used in the protease assay, the double mutant 
displayed 5-7% of the wild type B. subtilis activity. Although protease activity in this strain was low, we discovered that 
certain heterologous gene products produced by these protease deficient cells were not stable, indicating the presence 
of residual protease activity. We then sought to identify and mutate the gene(s) responsible for the residual protease 



In order to characterize the residual protease activity, a number of known protease inhibitors were tested for their 
ability to reduce protease levels in cultures of the double protease mutant strain. PMSF (phenylmethylsulfonyl flouride), 
a known inhibitor of serine protease activity, was found to be the most effective. The addition of PMSF to growing 
cultures of Apr Npr Bacillus cells successfully increased the stability of heterologous peptides and proteins synthe- 
sized in and secreted from these cells. These results indicated that at least a portion of the residual degradative activity 
was due to a serine protease. 

Subtilisin is the major serine protease to be secreted by B. subtilis ; however, the serine protease encoded by the 
isp-1 gene (ISP-1) has been shown to accumulate intracellular^ during sporulation (Srivastava et al., 1981, Arch. 
Microbiol., 1_29: 227). In order to find out if the residual protease activity was due to lsp-1 , a deleted version of the isp - 
1 gene was created in vitro and incorporated into the double-protease deleted strain. 

Deletion of the isp-1 gene 

The jsp_-1 gene is contained within a 2.7 kb Bam HI fragment of B. subtilis chromosomal DNA (Koide et aL, 19B6, 
J. Bact., 167:110). Purified DNA was digested with Bam HI and fragments in the 2,7 kb size range were electroeluted 
from an agarose gel, ligated into Bam HI digested pBR328 and transformed into E. colt JM 107 cells. One Amp r colony 
that produced a halo on LB media containing 1% casein was selected and named plSP-1. Restriction analysis of the 
DNA indicated that plSP-1 carried a 2.7 kb Bam HI insert which hybridized with a synthetic 25 base 32 P-labeled oligo- 
nucleotide probe [5'ATGAATGGTGAAATCCGCTTGATCC 3*] complementary to the jsg-1 gene sequence (Koide et al, 
supra ). The restriction pattern generated by Sajl and Eco RI digestions confirmed the presence of the jsg-1 gene in 
plSP-1. 

A deletion was created within the jsg-1 gene by taking advantage of a unique Sail site located in the center of the 
gene. Because there was an additional Sail site in the vector, the 2.7 kb Bam HI gene insert was first cloned into the 
Bam HI site of a derivative of pBR322 (pAL4) from which the Sail site had been eliminated (Fig. 4). The resulting plasmid, 
pAL5, therefore had a unique Sail site within the jsp_-1 gene pAL5 DNA was digested with Sajl, treated with Bal31 
exonuclease for five minutes at 37°C to delete a portion of the gene sequence, and religated. The DNA was transformed 
into JM107 and resulting Amp r colonies were screened for a Bam HI insert of reduced size. A plasmid with a 1 .2 kb 
deletion within the Bam HI insert was selected and named pAL6 (Fig. 4). 

The cat gene was purified from the E. coli plasmid pMI1101 on a Sail fragment as above and cloned into pAL6 at 
the Eco RV site. The resulting DNA was transformed into the double protease mutant strain (GP200) and integrants 
containing the deleted ISP-1 gene were selected as described above. The triple-protease deleted strain is called GP208 
(ap_rA, nprA, jsp_-1 A). Using a casein substrate, protease activity was measured in the triple-mutant strain (Apr, Npr, 
lsp-1-) and found to be 4% of the wild type level, about the same as the double mutant strain. 



25 
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The remaining 4% residual protease activity was apparently due either to a previously described esterase called 
bacillopeptidase F (Roitsch et al., 1 983, J Bact., 155 : 1 45), or to previously unknown and unidentified protease gene(s). 

Introduction of a sporulation mutation 

5 

Because it had been shown that the production of certain proteases was associated with the process of sporulation 
in B. subtilis , we reasoned that it may be useful to include a mutation which blocked sporulation in our protease deficient 
hosts and thus further reduce sporulation-dependent protease production in these strains. Mutations that block the 
sporulation process at stage 0 reduce the level of protease produced, but do not eliminate the ability of the cells to be 

io transformed by purified DNA. spoOA mutations have been shown to be particularly efficient at decreasing protease 
synthesis (Ferrari et al., 1986. J. Bact. 166 :173). 

We first introduced the spo OA mutation into the double protease deficient strain as one aspect of our strategy to 
eliminate the production of the serine protease, lsp-1 . We ultimately introduced the spoOA mutation into the triple- and 
quadruple- protease deficient strains. This feature is useful only when a promoter, contained within an expression 

is vector for the production of heterologous gene products in a Bacillus host, is not a sporulation-specific promoter (e.g. 
the spoVG promoter). 

Saturating amounts of chromosomal DNA were prepared from B. subtilis strain JH646 (spo OA, Prot + , Amy + , Met + ) 
or similar strains having a spo OA mutation, and transformed into competent GP200 cells (Spo*. Prot - , Amy - , Met"). 
Met+ transformants were selected by growth on minima) media plates. Resulting transformants were then screened 

2° for co-transformation of the spo OA allele by assaying on sporulation medium (Difco) for the sporulation deficiency 
phenotype, characterised by smooth colony morphology and the lack of production of a brown pigment. Approximately 
9% of the Met+ transformants appeared to be co-transformed with the spo OA allele; a number of these were rescreened 
on plates containing either starch-azure or casein to confirm that the recipients had not also been co-transformed with 
intact amylase or protease genes from the donor DNA. One transformant that did not exhibit detectable protease activity 

25 was designated GP205 (spoOA, amy E, aprA, nprEV Protease levels produced by this host were 0.1% of the level 
found in the extracellular fluid of the Spo + host, when casein was the substrate. 

In the same manner, the spo OA mutation was introduced into the triple protease deficient mutant GP208 (aprA, 
nprA, isp-1 A) and the quadruple protease deficient mutant GP216 (apr A, nprA, isp-1 A, eprA and described below). The 
resulting Spo- strains are GP210 and GP235, respectively. These strains are useful when the expression vector is not 

30 based on a sporulation dependent promoter. 

Identification of a new protease gene 

We expected that the isolation and cloning of the gene(s) responsible for the remaining protease activity would be 
35 difficult using conventional methods because cells did not produce large enough amounts of the enzyme(s) to detect 
by the appearance of halos on casein plates. We reasoned that it should be possible to isolate the gene(s) if it were 
replicated on a high-copy vector so that the copy number of the gene(s), and thus protease production, would be 
amplified to detectable levels. This strategy enabled us to isolate a novel protease gene from a Bacillus gene bank. 
The first of these new protease genes has been named ep_r (extracellular protease). Deletion mutants of this new gene 
40 were derived in vitro and introduced into the Apr Npr Isp- Bacillus host strains by gene replacement methods as 
described above. 

Cloning the epr gene 

45 In order to obtain a clone carrying a gene responsible for residual protease activity, a Sau 3A library of B^ subtilis 

GP208 DNA was prepared. Chromosomal DNA was isolated, subjected to partial digestion with Sau 3A and size-frac- 
tionated on an agarose gel. Fragments in the 3-7 kb size range were eluted from the gel and cloned into the BgJII site 
of pEc224, a shuttle vector capable of replicating in both E. coli and Bacillus (derived by ligating the large EcoRI-Pvull 
fragment of pBR322 with the large EcoRI-Pvull fragment of pBD64 (Gryczan et al., 1 978, PNAS 75:1428)). The ligated 

so DNA was transformed into cojj JM107 and plated on media containing casein. None of the 1200.E. colj colonies 
produced halos on casein plates, however by restriction analysis of the purified plasmid DNA, approximately 90% of 
the clones contained inserts with an average size of about 4 kb. The clones were transformed into a Bacillus host to 
screen for protease activity as follows. _E. coli transformants were pooled in twelve groups of 100 colonies each 
(G1-G12). The pooled colonies were grown in liquid media (LB + 50 ug/ml ampicillin), plasmid DNA was isolated, 

55 transformed into B. subtilis GP208 (aprA, nprA, isp-1 A) and plated on casein plates. Halos were observed around 
approximately 5% of transformants from pool G11. Plasmid DNA was isolated from each of the positive colonies and 
mapped by restriction enzyme digestion. All of the transformants contained an identical insert of approximately 4 kb 
(Fig. 5). One of these plasmids was selected and named pNPI . 
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Characterization of epr protease activity 

The residual protease activity remaining in GP20B faprA, nprA, isp-1A) cultures accounted for only a small per- 
centage of the total protease activity produced by the host. In order to characterize the type of protease encoded by 
the ep_r gene, the effect of different inhibitors on the protease secreted by B. subtilis GP208/pNP1 was examined. 

Culture media was obtained two hours into stationary phase and assayed using 14 C-casein as the substrate. The 
level of protease activity present in GP208 was not high enough to detect in the standard protease assay described 
above, however, appreciable protease activity was detected in the culture medium of GP208/pNP1, carrying the am- 
plified ep_r gene. The ep_r protease activity was inhibited in the presence of both 10 mM EDTA and 1mM PMSF sug- 
gesting that it encodes a serine protease which requires the presence of a cation for activity, (lsp-1, another serine 
protease, is also inhibited by EDTA and PMSF.) 

Subcloning the epr gene 

A 2.7 kb Hpa l-Sall sub-fragment was isolated from the pNP1 insert and cloned into pBs81/6, a derivative of pBD64 
(derived by changing the Pvull site to a Hindlll site using synthetic linkers). Transformants carrying this subcloned 
fragment were capable of producing halos on casein plates, indicating that the entire protease gene was present within 
this fragment. A representative clone was named pNP3. 

The location of the gene within the pNP3 insert was further defined by subcloning a 1 .6 kb Eco RV subfragment 
into pBs81/6 and selecting for the colonies producing halos on casein plates. A clone which produced a halo, and 
which also contained the 1.6 kb insert shown in Fig. 5, was designated pNP5. The presence of the protease gene 
within this fragment was confirmed by deleting this portion of the 4 kb insert from pNP1 . pNP1 was digested with Eco RV 
and religated under conditions which favored recircularization of the vector without incorporation of the 1 .6 kb Eco RV 
insert. The DNA was transformed into GP208 and colonies were screened on casein plates. Greater than 95% of the 
transformants did not produce halos, indicating that the protease gene had been deleted from these clones. A repre- 
sentative clone was selected and is designated pNP6. (The small percentage of colonies that produced halos were 
presumed to have vectors carrying the native ep_r gene resulting from recombination between the chromosomal copy 
of the gene and homologous sequences within the plasmid.) 

Nucleotide and deduced amino acid sequence of the epr gene 

Subcloning and deletion experiments established that most of the protease gene was contained on the 1.6 kb 
Eco RV fragment (Fig. 5). Determination of the nucleotide sequence of the 1.6 kb Eco RV fragment (Fig. 6) revealed 
an open reading frame which covered almost the entire fragment starting 450 bp from the left end and proceeding 
through the right end (see Fig. 2). Comparison of the deduced amino acid sequence with other amino acid sequences 
in GENBANK indicated that the protein encoded by the ORF had strong homology (approximately 40%) to both subtilisin 
(Stahl et al., 1984, J. Bact., 158:411) and lsp-1 (Koide et al. ( 1986, J. Bact., 167:110) from B. subtilis 168. The most 
probable initiation codon for this protease gene is the ATG at position 1 in Figure 6. This ATG (second codon in the 
ORF) is preceded by an excellent consensus B. subtilis ribosome binding site (AAAGGAGATGA ). In addition, the first 
26 amino acids following this methionine resemble a typical B. subtilis signal sequence: a short sequence containing 
two positively-charged amino acids, followed by 15 hydrophobic amino acids, a helix-breaking proline, and a typical 
Ala X Ala signal peptidase cleavage site (Perlman et al., 1983, J. Mol. Biol., 167 :391 ). 

Sequence analysis indicated that the ORF continued past the end of the downstream Eco RV site, even though 
the 1.6 kb EcoRV fragment was sufficient to encode Epr protease activity. To map the 3 / end of the gene, the DNA 
sequence of the overlapping Kpn l to Sail fragment was determined (Fig. 6). As shown in Figure 2, the end of the ORF 
was found 717 bp downstream of the Eco RV site and the entire ep_r gene was found to encode a 645 amino acid 
protein, the first approximately 380 amino acids of which are homologous to subtilisin (Fig. 6). The C-terminal approx- 
imately 240 amino acids are apparently not essential for proteolytic activity since N-terminal 405 amino acids encoded 
in the 1 .6 kb Eco RV fragment are sufficient for protease activity. 

Structure of the epr protein 

In vitro transcription-translation experiments were used to confirm the size of the protein. Plasmid pNP3 DNA 
(containing the 2.7 kb Hpa l-Sall fragment with the entire epj[ gene) was added to an S30-coupled transcription/trans- 
lation system (New England Nuclear) resulting in the synthesis of a protein of approximately 75,000 daltons. (Additional 
proteins of 60,000 and 34,000 daltons were also observed and presumably represented processed or degraded forms 
of the 75,000 dalton protein.) This size agreed reasonably well with the predicted molecular weight of 69,702 daltons 
for the primary product based on the deduced amino acid sequence. 
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The homology between the amino-terminal half of the epj protease and subtilisin suggests that Epr might also be 
produced as a preproenzyme with a pro sequence of similar size to that of subtilisin (70-80 amino acids). If true, and 
if there were no additional processing, this would argue that the mature Epr enzyme has a molecular weight of around 
58,000. Examination of culture supernatants, however, indicated that the protein has a molecular weight of about 

5 34,000. Comparison by SDS-PAGE of the proteins secreted by B. subtilis strain GP208 containing a plasmid with the 
epr gene (pNP3 or pNP5) or just the parent plasmid alone (pBs81/6) showed that the 2.7 kb Hpa l-Sall fragment (Figure 
1 ) cloned in pNP3 directed the production of proteins of about 34,000 and 38,000 daltons, whereas the 1 .6 kb Eco RV 
fragment cloned in pNP5 in the same orientation (Fig. 1) directed production of just the 34,000 dalton protein. The two 
proteins appear to be different forms of the Epr protease, resulting from either processing or proteolytic degradation. 

10 Clearly, the 1.6 kb Eco RV fragment, which lacks the 3' third of the ep_r gene, is capable of directing the production of 
an active protease similar in size to that observed when the entire gene is present. This suggests that the protease 
normally undergoes C-terminal processing. 

Bacillus strain GP208 containing the ep_r gene on plasmid pNP3 can be used to overproduce the Epr protease, 
which can then be purified by conventional procedures. 

15 

Location of epr on the B. subtilis chromosome 

To map ep_r on the B. subtilis chromosome, we introduced a drug-resistance marker into the chromosome at the 
site of the epj gene, and used phage PBS 1 -mediated transduction to determine the location of the insertion. A 1 .3 kb 

20 Eco RI fragment containing a chloramphenicol acetyttransferase (cat ) gene was cloned into the unique EcoRI site on 
an E. coli plasmid containing the epr gene (pNP2 is depicted in Figure 7). The resulting plasmid (pNP7) was used to 
transform B. subtilis GP208, and chloramphenicol resistant transformants were selected. Since the plasmid cannot 
replicate autonomously in B. subtilis , the Cm r transformants were expected to arise by virtue of a single, reciprocal 
recombination event between the cloned epr gene on the plasmid and the chromosomal copy of the gene. Southern 

£5 hybridization confirmed that the cat gene had integrated into the chromosome at the site of the cloned ep_r gene. 
Mapping experiments indicated that the inserted cat gene and ep_r gene are tightly linked to sac A321 (77% co-trans- 
duction), are weakly linked topurA16 (5% co-transduction), and unlinked tohisAI. These findings suggest that the epr 
gene is located near sac A in an area of the genetic map which does not contain any other known protease genes. 

30 Construction of epr Deletion Mutant 

To create a mutant Bacillus devoid of protease activity a deletion in the 5' end of the cloned gene was constructed 
and then used to replace the wild type gene in the chromosome. pNP2 was first digested with Bam HI. which cleaves 
at a unique site within the ep_r gene, then the linear plasmid DNA was treated with Bal31 exonuclease for 5 minutes at 

35 32°C, religated and transformed into JE. coli JM107. Plasmid DNA was isolated from 20 transformants, digested with 
Eco RI and Hin dlll to remove the egr gene insert and analyzed by gel electrophoresis. One of the plasmids had a 2.3 
kb Eco RI-HindHI fragment replacing the 2.7 kb fragment indicating that approximately 400 base pairs had been deleted 
from the epr gene sequence. This plasmid was designated pNP8 (Fig. 7). This deletion mutant was introduced into B. 
subtilis GP208 by gene replacement methods as described above. The cat gene, contained on an Eco RI fragment 

40 from pEccI, was introduced into the Eco RI site on pNP8 to create pNP9 (Fig. 7). This E. coli plasmid was used to 
transform _B. subtilis GP208 and Cm r colonies were selected. Most of the transformants produced a very small halo 
and the remaining 30% produced no halos on casein plates. The absence of a halo and therefore protease activity 
resulted from a double crossover between chromosomal DNA and homologous sequences from a concatemer of the 
plasmid DNA; these strains contain the E. coli replicon and cat gene flanked by two copies of the deleted ep_r gene. 

45 To screen for a strain that had undergone a recombination event between the two copies of the epr gene to resolve 
the duplication, but which had jettisoned the cat gene and the coli replicon, a single colony was selected and grown 
overnight in rich medium without drug selection. Individual colonies arising from this culture were then screened for 
drug resistance and about 0.1% of these were found to be Cm 3 . One such strain, GP216, containing deletions within 
the four protease genes (ap_r, np_r, jsg-1 and epr ) was selected for further study. 

50 The deletion in the chromosomal ep_r gene was confirmed by Southern hybridization. GP216, like the Cm r parent 

strain, failed to produce a halo on casein plates. In liquid cultures, however, l4 C-casein protease assays indicated that 
the ep_r mutation alone does not entirely eliminate residual protease activity. A strain with deletions in ep_r, ap_r, npr , 
and jsg, did not produce significantly less protease than a strain with mutations in just ap_r, npr , and jsj>. Finally, growth 
and sporulation of the quadruple protease deleted strain were assayed using standard laboratory media. No differences 

ss W ere observed in growth in LB medium when compared to the wild-type strain. Similarly, no appreciable differences 
were seen in sporulation frequency after growth on DSM medium for 30 hours (1 X 10 8 spores/ml for both GP208 and 
GP216). 
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Identification of Novel Proteolytic Activities 

Strains of B. subtilis have been deleted for four non-essential protease genes, agr, npr , isp -1 and epj\ These 
deletions reduce total extracellular protease levels in culture supematants of Spo+ hosts by about 96% compared to 
the wild-type strain, but it is desirable to decrease or eliminate the remaining 4% residual protease activity for the 
production of protease-labile products in Bacillus . 

Using the azacoll assay, we have identified two novel proteases that account for this residual activity in GP227, a 
multiple protease deficient B. subtilis strain fapr A , nprA, eprA, isp-1 A) which also contains a gene, sacQ\ encoding a 
requlatory protein. The sacQ* gene product functions by enhancing the production of degradative enzymes in Bacillus , 
including the residual protease activity(s) as described in our European Patent Application 86308356.4 (Publication 
No. EP-A-0227260) the disclosure of which is to be regarded as hereby incorporated by reference. Due to enhancement 
by sacQ*, strain GP227 produces substantially more protease activity than GP216, which lacks sacQ*. 

In general, supematants from cultures of B. subtilis GP227 were concentrated, fractionated by passage over a gel 
filtration column and assayed for protease activity. Two separate peaks of activity were eluted from the column and 
designated RP-l and RP-II (residual protease) for the larger and smaller molecular weight species, respectively Sub- 
sequent analysis of these two peaks confirmed that each accounted for a distinct enzymatic activity. The isolation and 
characterization of the RP-l and RP-II proteins, and the creation of a deletion mutation in each of the RP-1 and RP-l I 
genes are described below. 

Isolation and Characterization of RP-l 

A simple and efficient purification scheme was developed for the isolation of RP-l from spent culture fluids. Cultures 
were grown in modified MRS lactobacillus media (Difco, with maltose substituted for glucose) and concentrated ap- 
proximately 10-fold using an Amicon CH2PR system equipped with a S1Y10 spiral cartridge. The concentrated super- 
natant was dialyzed in place against 50mM MES, 0.4M NaCI, pH 6.8, and fractionated over a SW3000 HPLC gel 
filtration column equilibrated with the same buffer. The fractions containing protease activity were identified using a 
modification of the azocoll assay described above. 

Fractions which were positive for the protease activity, corresponding to the higher molecular weight species, were 
pooled and concentrated using a stirred cell equipped with a YM5 membrane, dialyzed vs. 50mM MES, 100mM KCI, 
pH 6.7 and applied to a benzamidine-Sepharose liquid affinity column equilibrated with the same buffer. Most of the 
protein applied to the column (97%) failed to bind to the resin, however RP-l protein bound quantitatively and was 
eluted from the column with 250mM KCI. 

SDS-PAGE analysis of the benzamidine purified RP-I revealed that the protein was greater than 95% homogene- 
ous, and had a molecular weight of approximately 47,000 daltons. Purification by the above outlined procedure resulted 
in a 140-fold increase in specific activity, and an overall recovery of about 10%. 

Isoelectric focusing gels revealed that RP-l has a pi between 4.4 and 4.7, indicating a high acidic/basic residue 
composition. The enzyme has a pH optimum of 8.0 and a temperature maximum of 60° C when azocoll is used as the 
substrate. It is completely inhibited by PMSF, indicating that it is a serine protease, but it is not inhibited by EDTA, even 
at concentrations as high as 50mM. 

RP-l catalyzes the hydrolysis of protein substrates such as denatured collagen and casing as well as ester sub- 
strates (0=C-O- vs. 0=C-N- linkages) such as N-a-benzolyl-L-arginine ethyl ester, phenylalanine methylester, tyrosine 
ethyl ester and phenylalanine ethyl ester, but does not catalyze hydrolysis of the arginine peptide bond in the synthetic 
substrate N-a-benzoyl-L-arginine-4-nitranilide. Collectively, these data demonstrate that RP-l is a serine endoprotei- 
nase that has esterase activity and belongs to the subtilisin superfamily of serine proteases. Furthermore, these char- 
acteristics indicate that RP-l may be the enzyme commonly referred to as Bacillopeptidase F (Boyer et al., 1968, Arch 
Biochem, Biophys., T28:442 and Roitsch et al., 1 983, J. Bact., 155: 1 45). Although Bacillopeptidase F has been reported 
to be a glycoprotein, we have not found carbohydrate to be associated with RP-l. 

Cloning the Gene for RP-l 

The sequence of the amino-terminal 28 amino acids of RP-l was determined by sequential Edman degradation 
on an automatic gas phase sequenator and is depicted in Figure 8. A DNA probe sequence (81 nucleotides) was 
synthesized based on the most frequent codon usage for these amino acids in B. subtilis (Figure 8). The N-terminal 
amino acid sequence of RP-l contains two tryptophan residues (positions 7 and 18). Since tryptophan has no codon 
degeneracy, this facilitated the construction of a probe that was highly specific for the gene encoding RP-l. 

High molecular weight DNA was isolated from B. subtilis strain GP216, digested with each of several different 
restriction endonucleases and fragments were separated by electrophoresis through a 0.8% agarose gel. The gel was 
blotted onto a nitrocellulose filter by the method of Southern (supra ) and hybridized overnight with the ^P end-labeled 
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synthetic RP-I specific probe under semi-stringent conditions (5X SSC, 20% formamide, IX Denhardts at 37° C). 
Following hybridization, the blot was washed for one hour at room temperature in 2X SSC, 0.1% SDS. 

The RP-I specific probe hybridized to only one band in each of the restriction digests indicating that the probe was 
specific for the RP-I gene. In the Pstl digest, the probe hybridized to a 6.5 kb fragment which was a convenient size 
5 for cloning and was also large enough to contain most or all of the RP-I gene. 

A clone bank containing Pstl inserts in the 6-7 kb size range was prepared from B. subtilis DNA as follows. Chro- 
mosomal DNA of strain GP216 was digested with Pstl and separated on a 0.8% agarose gel. DNA fragments of 6-7 
kb were purified from the gel by electroelution and ligated with Pstl digested pBR322 that had been treated with calf 
intestinal phosphatase to prevent recircularization of the vector upon treatment with ligase. The ligated DNA was trans- 
Jo formed into competent E. coli DH5 cells and plated on media containing tetracycline. Approximately 3 x 10 4 Tet r trans- 
formants resulted, 80% of which contained plasmids with inserts in the 6-7 kb size range. 

A set of 550 transformants was screened for the presence of the RP-I insert by colony hybridization with the 32 P- 
labeled RP-I specific probe and seven of these transformants were found to hybridize strongly with the probe. Plasmid 
DNA was isolated from six of the positive clones and the restriction digest patterns were analyzed with Pstl and Hin dlll. 
is All six clones had identical restriction patterns, and the plasmid from one of them was designated pCRB3. 

Using a variety of restriction enzymes, the restriction map of pCR83 insert shown in Figure 9 was derived. The 
RP-I oligomer probe, which encodes the N-terminal 28 amino acids of the mature RP-I protease, was hybridized with 
restriction digests of pCR83 by the method of Southern (supra ). The probe was found to hybridize with a 0.65 kb Cla l- 
EcoRV fragment suggesting that this fragment contained the 5' end of the gene. In order to determine the orientation 
20 of the RP-I gene, the strands of the Cla l -Eco RV fragment were separately cloned into the single-stranded phage M1 3. 
The M13 clones were then probed with the RP-I oligomer and the results indicated that the RP-I gene is oriented in 
the leftward to rightward direction according to the map in Figure 9. 

The DNA sequence of a portion of the Pst l insert, as shown in Figure 9, was determined, and an 81 base pair 
sequence (underlined in Figure 1 0) was found that corresponded exactly with the sequence encoding the first 28 amino 
25 acids of the protein. The_BgJII and Clal sites designated in Fig. 10 are identical to those designated in Fig. 9 and, in 
addition, the Eco RV site is identical to that designated in the restriction enzyme map shown In Fig. 9. Portions of the 
untranslated region surrounding the RP-I coding region are also shown in Fig. 10; the DNA sequence underlined within 
the 5' untranslated region corresponds to the putative ribosome binding site. 

The DNA sequence revealed an open reading frame that began at position-1 5 (in Figure 1 0) and proceeded through 
-30 to position 2270. The most probable initiation codon for this open reading frame is the ATG at position 1 in Figure 10. 
This ATG is preceded by a ribosome binding site (AAAGGGGGATGA), which had a calculated AG of -17.4 kcal. The 
first 29 amino acids following this Met resemble a B. subtilis signal sequence, with a short sequence containing five 
positively-charged amino acids, followed by 16 hydrophobic residues, a helix-breaking proline, and a typical Ala-X-Ala 
signal peptidase cleavage site. After the likely signal peptidase cleavage site, a "pro" region of 164 residues is followed 
35 by the beginning of the mature protein as confirmed by the determined N-terminal amino acid sequence. The first 
amino acid of the N-terminus, which was uncertain from the protein sequence, was confirmed as the Ala residue at 
position 583-585 from the DNA sequence. The entire mature protein was deduced to contain 496 amino acids with a 
predicted molecular weight of 52,729 daltons. This size was in reasonable agreement with the determined molecular 
weight of the purified protein of 47,000 daltons. In addition, the predicted isoelectric point of the mature enzyme (4.04) 
40 was in good agreement with the observed pi of 4.4-4.7. GENBANK revealed that the RP-I gene is partially homologous 
(30%) to subtilisin, to ISP-1 and, to a lesser extent (27%), to the ep_r gene product. 

Cloning the RP-I oene on a multicopy replicon 

45 The Pstl fragment was removed from pCR83 and ligated into Pstl linearized pBD9, a multicopy Bacillus replicon 

encoding erythromycin and kanamycin resistances. The ligated DNA was transformed into competent GP227 cells 
(the sacQ* enhancement strain) and kanamycin resistant transformants were selected. A plasmid carrying the 6.5 kb 
Pst l insert was chosen and designated pCR88. 

To confirm that this insert encoded the RP-I gene, GP227 cells containing pCR88 or pBD9 were grown in MRS 

50 medium under selective conditions for 50 hours at 37° C. Supernatant samples were collected and assayed for protease 
activity. Supernatants from the pCR88 cultures contained approximately 10-fold more protease activity than those from 
the pBD9 cultures. Furthermore, this secreted protease activity was inhibited by PMSF and, when fractionated on a 
denaturing protein gel, the supernatant from the pCR88 sample contained an extra protein of 47 kd. These results 
confirmed that the RP-I gene was encoded within the 6.5 kb fragment, and that cloning the sequence in a multicopy 

55 replicon leads to the overproduction of the RP-I protein. 
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Location of the RP-t Gene on the B. Subtilis Chromosome 

We mapped the location of the RP-I gene (bpr) on the B. subtilis chromosome by integrating a drug resistance 
marker into the chromosome at the site of bpr and using phage PBS 1 -mediated transduction to determine the location 

$ of the cat insertion. A 1 .3 kb Sma l fragment containing a chloramphenicol acety transferase (cat) gene was cloned into 
the unique EcoRV site of pCR92 (the 3.0 kb Bglll of pCR83 cloned into pUC18. The EcoRV site is in the coding region 
of bpr (Figure 10). The resulting plasmid, pAS11 2, was linearized by digestion with EcoR1 and then used to transform 
B. subtilis strain GP216, andchloramphenicol-resistant transformants were selected (GP238). Cm r transformants were 
expected to be the result of a double cross-over between the linear plasmid and the chromosome (marker replacement). 

10 Southern hybridization was used to confirm that the cat gene had integrated in the chromosome, interrupting the bpr 
gene. Mapping experiments indicating that the inserted cat gene and bp_r were strongly linked to£yrD1 (89%) and 
weakly linked to met C (4%). The gene encoding the neutral protease gene (npr) also maps in this region of the chro- 
mosome, although npr is less tightly linked to£Yj (45% and 32%) and more tightly linked to met C (18% and 21%) than 
is bpr. 

15 

Construction of a deleted version of the RP-I gene 

An internal deletion in the RP-I sequence was generated in vitro . Deletion of the 650 bp sequence between the 
Cla l and Eco RV sites in the pCR83 insert removed the sequence encoding virtually the entire ami no -terminal half of 

20 the mature RP-I protein. The deletion was made by the following procedure. 

The 4.5 kb Pstl -Eco RI fragment of PCR7B (a pBR322 clone containing the 6.5 kb Pstl fragment) was isolated and 
ligated to pUC18 (a vector containing the _E. coli JacZ gene encoding p-galactosidase) that had been digested with 
Eco RI and Pstl. The ligation mix was then transformed into E. coli DH5 cells. When plated onto LB media containing 
Xgal and ampicillin, eight white colonies resulted, indicating insertion of the fragment within the gene encoding 0- 

25 galactosidase. Plasmid DNA prepared from these colonies indicated that seven of the eight colonies contained plasmids 
with the 4.5 kb insert. One such plasmid, pKT2, was digested with Eco RV and Cla l, treated with Klenow fragment to 
blunt the C]§l end and then recircularized by self-ligation. The ligated DNA was then transformed into E^ colj DH5 cells. 
Approximately 100 transformants resulted and plasmid DNA was isolated from Amp r transformants and analyzed by 
restriction digestion. Eight of eight clones had the Clal -Eco RV fragment deleted. One such plasmid was designated 

30 pKT2'. The cat gene, carried on an Eco RI fragment from pEccI was then ligated into pKT2 / for use in selecting Bacillus 
integrants as described above. To insert the cat gene, pKT2 / was digested with Eco RI , treated with calf intestine alkaline 
phosphatase and ligated to a 1 .3 kb Eco RI fragment containing the cat gene. The ligated DNA was transformed into 
DH5 cells and the Amp r colonies that resulted were patched onto LB media containing chloramphenicol. Two of 100 
colonies were Cm r . Plasmid DNA was isolated from these two clones and the presence of the 1 .3 kb cat gene fragment 

35 was confirmed by restriction enzyme analysis of plasmid DNA. One of these plasmids, pKT3, was used to introduce 
the deleted gene into strain GP216 by gene replacement methods. 

The DNA was transformed into GP216 and chloramphenicol resistant colonies were selected. Chromosomal DNA 
was extracted from 8 Cm R colonies and analyzed by Southern hybridization. One clone contained two copies of the 
deleted RP-I gene resulting from a double crossover between homologous sequences on the vector and in the chro- 

40 mosome. The clone was grown in the absence of chloramphenicol selection and was then replica plated onto TBAB 
media containing chloramphenicol. One Cm 3 colony was isolated and Southern analysis confirmed that the deleted 
gene had replaced the wild-type RP-I gene in the chromosome. This strain was designated GP240. Analysis of super- 
natants from cultures of GP240 confirmed the absence of RP-I activity. 

45 Isolation and Characterization of RP-I I 

The purification scheme for RP-II was more extensive than for RP-l because RP-II failed to bind benzamidine- 
Sepharose or other protease-affintty resins, e.g., arginine-Sepharose and hemoglobin-agarose, and we thus found it 
necessary to use more conventional purification techniques such as ion exchange chromatography, gel filtration and 

so polyacrylamide gel electrophoresis. 

Concentrated crude supernatants of GP227 cultures were fractionated over DEAE-Sephacel (anion exchange) 
equilibrated at pH 6.8. At this pH the RP-II protein failed to bind the resin; however, approximately 80% of the total 
applied protein, including RP-l, bound the resin and was thus removed from the sample. The column eluate was then 
fractionated by cation exchange chromatography using CM-Sepharose CL-6B equilibrated at pH 6.8. RP-II was capable 

55 of binding to the resin under these conditions and was then eluted from the column with 0.5 M KCI. To further enhance 
the resolution of the cation exchange step, the RP-II eluate was then refract ionated over a 4.6 x 250 mm WCX (weak 
cation exchange) HPLC column developed with a linear gradient of NaCI. The WCX pool was then size-fractionated 
over a TSK-125 HPLC column. The RP-II peak was then fractionated a second time over the same column yielding a 
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nearly homogeneous preparation of RP-II when analyzed by SDS-PAGE. The protease was purified over 6900-fold 
and represented approximately 0.01% of the total protein in culture fluids of GP227. Alternatively, approximately 30 
fold more RP-II can be purified from a Bacillus strain that is RP-h and contains the sacQ* enhancing sequence (U.S. 
S.N. 921 ,343, assigned to the same assignee and hereby incorporated by reference), since the quantity of RP-II pro- 

5 duced by such a strain is substantially increased, representing about 0.3% of total protein in the culture fluid. 

RP-II was insensitive to PMSF treatment, and therefore is not a serine protease. SDS-PAGE analysis indicated 
that RP-II has a molecular mass of 27.3 kd. The failure of RP-II to bind DEAE at pH 6.7 and PAE-300 (an HPLC anionic 
column) at pH B.3 indicated that the protein has a basic isoelectric point which is greater than 8.3 (pi = 6.7 by chro- 
matofocusing). RP-II is highly sensitive to dithiothreitol (DTT, a sutfhydryl reducing agent), being quantitatively inhibited 

10 at levels as low as 1 mM in the azocoll assay. RP-II is also sensitive to combinations of other sutfhydryl reagents with 
metal chelators (i.e., mercaptoethanol with EDTA). Inhibition of proteases by sutfhydryl reagents is relatively rare and 
has only been described for a few proteases, such as collagenase from C. histolyticum and carboxypeptidase A. RP- 
II also possesses esterase activity as demonstrated by its ability to hydrolyze phenylalanine methyl ester and n-t-BOC- 
L-glutamic acid-a-phenyl ester. 

*s In order to obtain the cleanest possible sample of RP-II for sequence analysis, a final purification step was used 

which involved separation by polyacrylamide gel electrophoresis. Following electrophoresis, proteins were transferred 
electrophoretically from the gel to a sheet of polyvinylidene difluoride (PVDF) membrane. RP-II was visualized on the 
hydrophobic membrane as a "wet-spot" and the corresponding area was cut from the sheet and its amino-terminal 
amino acid sequence determined. 

20 The sequence of the 15 amino acid terminal residues of RP-II (Ser-lle-lle-Gly-Thr-Asp-Glu-Arg-Thr-Arg-lle-Ser- 

Ser-Thr-Thr-) is rich in serine and arginine residues. Since both serine and arginine have a high degree of codon 
degeneracy, this increased the difficulty in creating a highly specific probe. Therefore, additional amino acid sequence 
information was obtained from internal peptides that contained one or more non-degenerate amino acid residues. 

25 Sequence Analysis of Internal Peptide Fragments of RP-lt 

Tryptic peptides from purified RP-II were produced and isolated using reverse-phase HPLC. Since each of the 
amino acids tryptophan and methionine is encoded by only one amino acid codon, a synthetic nucleotide probe, or 
"guess-mer" that encodes one or more of either of these amino acids will be highly specific for its complementary 
30 nucleotide sequences. 

An HPLC chromatograrn of the RP-II trypsin digested mixture was monitored at three wavelengths: 21 0 nm (peptide 
bonds), 227 nm (aromatic residues, i.e., phenylalanine, tyrosine, tryptophan), and 292 nm (conjugated ring structure 
of tryptophan). The 292 nm trace was used to identify peptides of RP-II that contain a tryptophan residue. The 210 nm 
trace was used to obtain baseline resolved (i.e., single-species peptides) fragments for sequence analysis. Based on 
35 the 210 nm and 292 nm traces, three fragments were chosen for sequence analysis: T90, T94, and T92. Guess-mer 
oligomers were then synthesized based on the amino acid sequences of these fragments. 

Figure 1 1 (a) is the amino-terminal sequence obtained for RP-II fragment T90. A total of 1 5 residues were obtained, 
67% of which have onty one or two possible codons. The specificity of a probe (BRT90) constructed based on the 
sequence of fragment T90 was enhanced by the presence of a predicted tryptophan residue (position 12). The number 
40 jn parentheses at each position represents the possible number of codons for each residue. 

The amino-terminal sequence of RP-II fragment T94 is shown in Figure 11(b). Of the 30 residues determined, 
none were found to be tryptophan. Although only 36% of the residues (numbers 1-25) have two possible codons, the 
length of the corresponding 75-mer probe (707) renders it useful for corroborating hybridization experiments conducted 
with the T90 probe. 

4£ The third and final probe was constructed based on sequence information obtained from RP-II fragment T92 (Fig. 

11(c)). Because of the relatively high degree of degeneracy at the beginning and end of this sequence, a probe was 
constructed based on residues 15-27. The resulting 39-mer probe (715) codes for a peptide of which half the residues 
have only one or two possible codons. Furthermore, the specificity of this probe was enhanced by the tandem location 
of a methionine and tryptophan residue at positions 26 and 27. 



so 



Cloning of RP-II 



Chromosomal DNA was cut with various restriction enyzmes and a series of hybridizations using the radiolabeled 
oligomer probes BRT90 and 707 were performed. Both probes were labelled with 32 P and hybridized to a Southern 
5S blot of GP241 DNA digested with Bam HI, Bglll, Hindi, Pst l, or EcoRI under semi-stringent conditions (5 x SSC, 10% 
formamide, 1 x Denhardt's, 100 u,g/ml denatured salmon sperm DNA at 37°C). After hybridization for 18 hours, the 
blots were washed with 2 x SSC, 0.1% SDS for one hour at 37°C, and then washed with the same buffer at 45°C for 
one hour. The results are shown in Fig. 12. Both probes hybridized to the same restriction fragments: Hind I, -1 kb; 
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Pst l. 3-4 kb, and Eco RI, 6-7 kb. The probes also hybridized to very large fragments in the Bam HI and BgJIl -digested 
DNAs. 

Pst l fragments of 3-4 kb were used to construct a DNA library, as follows. pBR322 was digested with Pstl and 
treated with CIAP. Size-selected Pstl-digested GP241 chromosomal DNA of 3-4.5 kb was electroeluted from a 0.8% 
5 agarose gel. Approximately 0.1 u~g of Pstl-cut pBR322 and 0.2 u.g of the size-selected DNA was ligated at 16°C over- 
night. The ligated DNA was then transformed into^. coli DH5 cells. Approximately 10,000 colonies resulted, of which 
60% contained plasmids with the insert DNA. 1400 colonies were patched onto LB plates containing 15 u.g/ml tetra- 
cycline with nitrocellulose filters. After colonies were grown at 37°C overnight, the filters were processed to lyse the 
colonies, denature the DNA, and remove cell debris. The filters were then baked at 80° for two hours. Colony hybrid- 

io ization was performed using radiolabeled probe 707. Hybridization conditions were identical to those used in the South- 
ern blot experiments. Analysis of the plasmid DNA from four positive colonies identified one as containing plasmid 
DNA that contained a 3.6 kb insert which strongly hybridized to both probes. The plasmid, pLP1 , is shown in Fig. 1 3(b). 

A restriction map of pLP1 (Fig. 1 3(a)) was constructed using a variety of restriction endonucleases to digest pLP1 , 
transferring the size-fractionated digests onto nitrocellulose, and probing the immobilized restriction fragments with 

*s the radiolabeled oligomers described above, it was determined that all three oligomers, which encode a total of 53 
amino acids within the RP-ll protein, hybridized with the 1.1 kb Hindi fragment. 

The 1.1 kb Hindi fragment was isolated and cloned into M13mp18. A phage clone containing the Hindi fragment 
was identified by hybridization with one of the oligomer probes. The DNA sequence of the Hind i fragment revealed 
an open reading frame that spanned most of the fragment (position -24 to position 939 in Figure 1 4). The most probable 

20 initiation codon for this open reading frame is the ATG at position 1 in Figure 14. This ATG is preceded by a B. subtilis 
ribosome binding site (AAAGGAGG), which has a calculated AG of -16.0 kcal. The first 33 amino acids following this 
Met resembled a B. subtilis signal sequence, with a short sequence containing four positively-charged amino acids, 
followed by 18 hydrophobic residues, a helix-breaking proline, and a typical Ala-X-Ala signal peptidase cleavage site. 
After the presumed signal peptidase cleavage site, a "pro" region of 58 residues is found, followed by the beginning 

2S of the mature protein as determined by the N-terminal amino acid sequence of the purified protein. The amino terminal 
16 residues are underlined and designated "N terminus". Amino acid sequences from which the three guess-mers 
were deduced are also underlined and designated T94, T92, and T90. The determined amino acid sequences of the 
peptides matched the deduced amino acid sequence except for a serine residue encoded by nucleotides 379-381 and 
a cysteine residue encoded by nucleotides 391 -393. The determined amino acid sequence predicted a cysteine residue 

30 (position 14, T94 peptide) and an asparagine residue (position 18, T94 peptide), respectively (Figure 11). The entire 
mature protein was deduced to contain 221 amino acids with a predicted molecular weight of 23,941 daltons. This size 
was in approximate agreement with the determined molecular weight of the purified protein 28,000 daltons. 

The deduced amino acid sequence showed only limited homology to other sequences in GENBANK. The strongest 
homology was to human protease E and bovine procarboxypeptidase A in a 25 amino acid sequence within RP-ll 

35 (131-155, encoded by nucleotides 391-465; Figure 14). 

To further confirm the identity of the RP-ll gene, the 3.6 kb Pstl fragment was engineered onto a multi-copy Bacillus 
replicon to test for overproduction of the RP-ll protein. For this purpose the Bacillus plasmid pBs81/6 (Cm r , Neo r ) was 
inserted into the E. coli clone containing the RP-ll gene. Plasmid pLP1 (8.0 kb) was digested with Eco RI, which cuts 
at a single site outside the Pstl insert, and ligated to Eco RI -digested pBs81/6 (4.5 kb;.Fig. 1 3(a)). The resulting plasmid 

to (pCR130) was used to transform GP241, and chloramphenicol or neomyc in-resistant transformants were selected. 
Supernatant samples from cultures of the transformants were found to contain 3-4 fold more azocoll-hydrolyzing activity 
than the supernatants from cells containing only the plasmid pBs81/6, indicating that the gene lor RP-ll is wholly con- 
tained within the 3.6 kb Pstl fragment. 

45 Location of the RP-ll Gene on the B. subtilis chromosome 

In order to map the RPII gene (mpr) on the B. subtilis chromosome, we used B. subtilis strain GP261 described 
below which contained the cat gene inserted into the chromosome at the site of the mpr gene and used phage PBS1 
transduction to determine the location of the cat insertion. 
50 Mapping experiments indicated that the inserted cat gene and mpr were linked to cysA14 (7% co-transduction) 

and to arol906 (36% co-transduction) but unlinked to purA16 and dal. This data indicated that the mpr gene was 
between cysA and arol in an area of the genetic map not previously known to contain protease genes. 



55 



Deletion of the RP-ll Gene on the Bacillus Chromosome 

As described above for the other Bacillus subtilis proteases, an RP-ll Bacillus deletion mutant was constructed by 
substituting a deleted version of the RP-ll gene for the complete copy on the chromosome. To ensure the deletion of 
the entire RP-ll gene, a region of DNA was deleted between the two Hpa l sites in the insert (Fig. 13(a)). This region 
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contains the entire 1.1 kb Hindi fragment and an additional 0.9 kb of DNA upstream of the Hindi fragment. 

To create the deletion, plasmid pLPI (the pBR322 clone containing the 3.6 kb Pstl fragment) was digested with 
Hpa l and size-fractionated on an agarose gel. Digestion of pLP1 results in the release of the 2 kb internal Hpa l fragment 
and a larger Hpa l fragment containing the vector backbone and segments that flank the Pstl insert (Fig. 13(c)). The 
larger Hpa l fragment was purified and ligated with purified blunt-ended DNA fragments containing either the chloram- 
phenicol-resistance (cat) gene from pMI1101 (Youngman et al , 1984, supra ) or the bleomycin resistance (ble) gene 
from pKT4, a derivative of pUB110 (available from the Bacillus Stock Center, Columbus Ohio). 

The cat gene was isolated as a 1 .6 kb Sma l fragment from pEcd . This DNA was ligated to the isolated large Hpa l 
fragment of pLP1 . The ligated DNA was then transformed into E. coli DH5 cells. Approximately 20 Tet r colonies resulted. 
One colony was found to be Cm r when the colonies were patched onto LB medium + 5 u,g/ml chloramphenicol. Analysis 
of the plasmid DNA from this colony confirmed the presence of the cat gene. This plasmid was called pLP2. 

Plasmid pLP2 (Fig. 13(c)) was digested with Pstl and then transformed into GP241. This transformation gave 
approximately 280 Cm r colonies; one colony was chosen for further study (GP261). Competent cells of GP261 were 
prepared and then transformed with pDP104 (sacQ*); 10 Tet r colonies resulted. Four colonies were grown in MRS 
medium and the presence of sacQ* was confirmed by elevated levels of aminopeptidase. This strain was called GP262. 

Since the cat gene was often used to select other vectors, a different antibiotic resistance was also used to mark 
the deletion of the RP-II gene on the Bacillus chromosome; i.e., the bleomycin-resistance gene of pUB110. The ble 
gene was isolated from plasmid pKT4, a derivative of pUB110, as an Eco RV -Sma l fragment and ligated to the purified 
large Hpa l fragment (Fig. 1 3(c)) before tranformation into E. coli DH5 cells; tetracycline-resistant transformants were 
selected and then screened for resistance to phleomycin, a derivative of bleomycin, by patching onto TBAB plates 
containing phleomycin at a final concentration of 2 u.g/ml. Of 47 Tet r transformants so screened, seven were also 
phleomycin-resistant. The insertion of the ble gene was confirmed by restriction analysis of the plasmids isolated from 
these clones. One of these plasmids, pCR125 (Fig. 1 3(c)), was used to introduce the deleted gene containing the ble 
gene marker into the strain GP241 by gene replacement methods, as described below. 

Plasmid pCR1 25 was digested with Eco RI and the linear plasmid DNA was used to transform GP241 to phleomycin 
resistance. Resistant transformants were selected by plating the transformed cells onto TBAB agar plates containing 
a gradient of 0-5 u.g/ml phleomycin across the plate. Transformants that were resistant to approximately 2.5 u.g/ml 
phleomycin on the plates were single-colony purified on TBAB phleomycin plates and thereafter grown on TBAB without 
selective antibiotic (strain GP263). 

The strains bearing the RP-I I deletion and the cat or ble insertion in the RP-I I gene, along with the positive regulatory 
element, sacQ*, were evaluated for extracellular enzyme production, particularly protease and esterase activities. 

The data given in Table 1 , below, indicate that the presence of sacQ* in B. subtilis strain GP239, which bears null 
mutations in the five protease genes ap_r (subtilisin), np_r (neutral protease), ep_r (extracellular protease), jsfi (internal 
serine protease), and bpr, enhanced production of the RP-II protease (which also has esterase activity). To assess the 
influence on protease production of deleting RP-II from strains of B. subtilis bearing the sacQ* regulatory element, the 
following experiments were performed. 

Independent clones of the RP-II deletion strain GP262 were shown to produce negligible amounts of esterase 
activity and no detectable levels of endoprotease activity using azocoll as substrate (Table I). To confirm the absence 
of protease activity, culture supernatants from GP262 were concentrated to the extent that the equivalent of 1 ml of 
supernatant could be assayed. Even after 2.5 hours incubation of the equivalent of 1 ml of supernatant with the azocoll 
substrate, there was no detectable protease activity in the deleted RP-II strain. By comparison, 50uJ of supernatant 
from GP239 typically gave an A^ in the azocoll assay of over 2.0 after a one hour incubation at 55°C. (The presence 
of sacQ* was confirmed by measurement of the levels of aminopeptidase present in the culture fluids of this strain, 
which were 50-80 fold higher than in analogous strains lacking sacQ*.) Thus, deletion of the two residual proteases, 
RP-I and RP-II, in Bacillus yields a strain that is largely incapable of producing extracellular endoproteases, as meas- 
ured using azocoll as a substrate under the conditions described above. 



Table 1 



so 



ss 



Strain 


Aminopeptidase 


Protease 


Esterase 


(U/ml) 


(U/ml) 


(U/ml) 


GP238 


0.04 


0.13 


0.02 


GP239 


1.7 


84 


1.16 


GP262, Al 


2.9 


ND 


0.08 


GP262, All 


3.4 


ND 


0.11 


GP262, Bl 


1.9 


ND 


0.10 


GP262.BII 


2.5 


ND 


0.10 
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Aminopeptidase was measured using L-leucine-p-nitroanilide as substrate (1 unit = u,mols substrate hydro- 
lyzed/minute). Protease was measured using the standard azocoll assay (1 unit = AA 520 of 0.5/hour). Esterase was 
measured using N-t-BOC -glutamic acid-a-phenyl ester as substrate (1 unit = pjnols substrate hydrolyzed/minute). 
Strain GP238 has the genotype Aapr, Anpr , Aepr , Aisp, Arg-1 ; strain GP239 has the genotype Aapr , Anpr, Aepr , Aisp, 

s Arp-1 , sac Q*; and GP262 Al, Alt, Bl.and Bll are independent clones of GP262 containing sacQ* and a cat insertional 
deletion in RP-IL ND means not detectable. 

Referring to Table 2, several protease-deficient strains were also tested for protease activity using the more sen- 
sitive resorufin-labelled casein assay described earlier. As is shown in Table 2, although the strain GP263, deleted for 
six protease genes, exhibited no detectable protease activity in the azocoll test, such activity was detected in the 

10 resorufin-labelled casein test. GP271, the spoOA derivative of GP263, exhibited no detectable protease activity in 
either test, indicating that the prior protease activity detected in GP263 may be under sporulation control. The minor 
casein-detectable activity present in culture fluids of GP263 apparently belongs to the serine protease family, because 
of its sensitivity to inhibition by PMSF. In the presence of PMSF, no detectable protease activity was present in cultures 
of GP263. 

15 
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Table 2 



Remaining activity 
(% of wild-type at tio) 



Strain Genotype 1 2 

IS75 Wild-type 100 100 

GP202 &apr, Anpr, amyE 5 8 

-GP208 Aapr, Anpr, Aisp-l, amyE, met" 5 8 

GP263 Aapr, Anpr, Aisp-1, tepr, Abpr, ND 0.5-1 
Ampr, Ahpr, amyE, met~ 

GP271 spoOA, Aapr, Anpr, Aisp-1, Aepr, ND ND 
Abpr, Ampr, Ahpr, amyE, met" 



1 As measured using azocoll as substrate. 

2 As measured using resoruf in casein as substrate. 

Other embodiments are feasible. 

For example, in some instances it may be desirable to express, rather than mutate or delete, a gene or genes 
encoding protease(s). 

This could be done, lor example, to produce the proteases for purposes such as improvement of the cleaning activity 
of laundry detergents or for use in industrial processes. This can be accomplished either by inserting regulatory DNA 
(any appropriate Bacillus promoter and, rf desired, ribosome binding site and/or signal encoding sequence) upstream 
of the protease-encoding gene or, alternatively, by inserting the protease-encoding gene into a Bacillus expression or 
secretion vector; the vector can then be transformed into a Bacillus strain for production (or secretion) of the protease, 
which is then isolated by conventional techniques. Alternatively, the protease can be overproduced by inserting one 
or more copies of the protease gene on a vector into a host strain containing a regulatory gene such as sacQ*. 
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Claims 



1. A Bacillus cell characterised in containing a mutation in the ep_r gene, said epr gene encoding a protein comprising 
the amino acid sequence of Figure 6 or an evolutionary homoiogue thereof of other Bacillus species, having pro- 
tease activity, said mutation resulting in inhibition of the production by said cell of proteolytically active epj; gene 
product. 



2. A Bacillus cell according to Claim 1 , characterised in further containing a mutation in the RP-l-encoding gene, said 
RP-I encoding gene encoding a protein comprising the amino acid sequence of Figure 10 or an evolutionary homo- 
iogue thereof of other Bacillus species, having protease activity, said mutation resulting in inhibition of the produc- 
tion by said cell of proteolytically active RP-I. 

3. A Bacillus cell characterised in containing a mutation in the RP-l-encoding gene, said RP-I encoding gene encoding 
a protein comprising the amino acid sequence of Figure 10 or an evolutionary homoiogue thereof of other Bacillus 
species, having protease activity, said mutation resulting in inhibition of the production by said cell of proteolytically 
active RP-I. 



A Bacillus cell according to any preceding claim, characterised in further containing a mutation in the RP-II encoding 
gene, said RP-II encoding gene encoding a protein comprising the amino acid sequence of Figure 14 or an evo- 
lutionary homoiogue thereof of other Bacillus species, having protease activity, said mutation resulting in inhibition 
of the production by said cell of proteolytically active RP-II. 

A Bacillus cell characterised in containing a mutation in the RP-II -encoding gene, said RP-II encoding gene encod- 
ing a protein comprising the amino acid sequence of Figure 14 or an evolutionary homoiogue thereof of other 
Bacillus species, having protease activity, said mutation resulting in inhibition of the production by said cell of 
proteolytically active RP-II. 

A Bacillus cell according to any preceding claim, characterised in further containing mutations in the a^r and npr 
genes encoding extracellular proteases, said mutations resulting in inhibition of the production by said cell of said 
encoded proteolytic activities. 

A Bacillus cell according to any preceding Claim, further characterised in that the or each said mutation comprises 
a deletion within the coding region of the gene. 

A Bacillus cell according to any preceding claim, further containing a mutation in the isp-1 gene encoding an 
intracellular protease, said mutation resulting in inhibition of production by said cell of proteolytically active isp-l 
gene product. 

A Bacillus cell according to any of Claims 1 to 7, characterised in further containing a mutation which reduces said 
cell's capacity to produce one or more sporulat ion -dependent proteases. 

10. A Bacillus cell according to Claim 9, further characterised in that said sporulation-dependent protease mutation 
blocks sporulation at an early stage but does not eliminate the cell's ability to be transformed by purified DNA. 

45 11. A Bacillus cell according to Claim 10, further characterised in that said sporulation-dependent protease mutation 
is in the spoOA gene. 

12. A Bacillus cell according to any preceding claim, further characterised in being a Bacillus subtilis cell. 

50 13. A Bacillus cell according to any preceding claim, characterised in further comprising a gene encoding a heterolo- 
gous polypeptide. 

14. A cell according to Claim 13, further characterised in that said heterologous polypeptide is a hormone, vaccine, 
antiviral protein, antitumour protein, antibody or clotting protein. 

55 

15. A cell according to Claim 1 3, further characterised in that said heterologous polypeptide is a pesticide or enzyme. 

16. A method for producing a heterologous polypeptide in a Bacillus cell, characterised in comprising: introducing into 
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said cell a gene encoding said heterologous polypeptide, modified to be expressed in said cell, said Bacillus cell 
containing mutations in the apr and np_r genes, and further containing mutations in one or more of the genes 
encoding the Epr protease, RP-I, or RP-II, said Epr, RP-I and RP-II comprising the amino acid sequences set out 
in Figures 6, 10 and 14 respectively, or evolutionary homologues thereof of other Bacillus species, having protease 
activity, wherein said mutation results in the inhibition of the production by said cell of proteolytically active Epr 
protease, RP-I or RP-II. 



17. A method according to Claim 16, characterised in further containing a mutation in thejsjg-l gene encoding intra- 
cellular protease I, said mutation resulting in inhibition of production by said cell of proteolytically active jsp_-l gene 

io product. 

18. A method according to Claims 16 or 17, further characterised in that said heterologous polypeptide is normally 
unstable in a Bacillus cell. 

'5 19. A method according to any of Claims 16, 17 or 18, further characterised in that said cell is a Bacillus subtilis cell. 

20. A method according to any of Claims 16 to 1 9, further characterised in that said cell further contains a mutation 
which reduces said cell's capacity to produce one or more sporulation-dependent proteases, said mutation being 
in the spoOA gene. 

20 

21. A method according to any of Claims 16 to 20, further characterised in that said heterologous polypeptide is a 
hormone, vaccine, antiviral protein, antitumour protein, antibody, clotting protein, pesticide or enzyme. 

22. Purified DNA comprising a Bacillus epr gene, said gene encoding the amino acid sequence of Figure 6 or an 
£5 evolutionary homologue thereof of other Bacillus species, having protease activity 

23. Purified DNA comprising a Bacillus gene encoding RP-I, said gene encoding the amino acid sequence of Figure 
10 or an evolutionary homologue thereof of other Bacillus species, having protease activity. 

30 24. Purified DNA comprising a Bacillus gene encoding RP-II, said gene encoding the amino acid sequence of Figure 
14 or an evolutionary homologue thereof of other Bacillus species, having protease activity. 

25. A vector comprising a Bacillus epr gene, said gene encoding the amino acid sequence of Figure 6 or an evolutionary 
homologue thereof of other Bacillus species, having protease activity, and regulatory DNA operationally associated 

35 with said gene. 

26. A vector comprising a Bacillus gene encoding, RP-I said gene encoding the amino acid sequence of Figure 10 or 
an evolutionary homologue thereof of other Bacillus species, having protease activity, and regulatory DNA oper- 
ationally associated with said gene. 

40 

27. A vector comprising a Bacillus gene encoding, RP-II said gene encoding the amino acid sequence of Figure 14 
or an evolutionary homologue thereof of other Bacillus species, having protease activity, and regulatory DNA oper- 
ationally associated with said gene. 

45 28. A Bacillus cell transformed with a vector according to any of Claims 25, 26 and 27. 

29. Substantially pure Bacillus Epr protease comprising the amino acid sequence of Figure 6 or an evolutionary homo- 
logue thereof of other Bacillus species, having protease activity. 

50 30. Substantially pure Bacillus residual protease I (RP-I) comprising the amino acid sequence of Figure 10 or an 
evolutionary homologue thereof of other Bacillus species, having protease activity. 

31. Substantially pure Bacillus residual protease II (RP-II) comprising the amino acid sequence of Figure 14 or an 
evolutionary homologue thereof of other Bacillus species, having protease activity. 
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Patontansprucho 

1. Bacillus-Zelle, dadurch gekennzeichnet, daB sie eine Mutation im epr-Gen enthalt, wobei das epr-Gen fur ein 
Protein, das die Aminosauresequenz aus Figur 6 umfaBt Oder ein evolutionares Homologes davon von anderen 

5 Bacillus -Spezien mit Protease-Aktivitat kodiert, und die Mutation zur Inhibierung der Produktion des proteolytisch 

aktiven egr-Genprodukts durch diese Zelle fuhrt. 

2. Bacillus -Zelle nach Anspruch 1, dadurch gekennzeichnet, daB sie weiterhin eine Mutation im fur die RP-I kodie- 
renden Gen enthart, wobei das fur die RP-I kodierende Gen fur ein Protein, das die Aminosauresequenz von Figur 

10 10 umfaBt oder ein anderes evolutionares Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat 

kodiert, und die Mutation zur Inhibierung der Produktion von proteolytisch aktivem RP-I durch diese Zelle fuhrt. 

3. Bacillus -Zelle. dadurch gekennzeichnet, daB sie eine Mutation im fur die RP-I kodierenden Gen enthalt, wobei 
das fur die RP-I kodierende Gen fur ein Protein, das die Aminosauresequenz von Figur 10 umfaBt oder ein evo- 

1$ lutionares Homologes davon von anderen Bacillus -Spezien mit Protease-Aktivitat kodiert, und die Mutation zur 

Inhibierung der Produktion von proteolytisch aktivem RP-I durch diese Zelle fuhrt. 

4. Bacillus -Zelle nach einem der vorangegangenen Anspruche, dadurch gekennzeichnet, daB sie weiterhin eine 
Mutation im fur die RP-11 kodierenden Gen enthatt, wobei das fur die RP-I I kodierende Gen fur ein Protein, das 

20 die Aminosauresequenz von Figur 14 umfaBt oder ein evolutionares Homologes davon von anderen Bacillus- 

Spezies mit Protease-Aktivitat kodiert, und die Mutation zur Inhibierung der Produktion von proteolytisch aktivem 
RP-ll durch diese Zelle fuhrt. 

5 - Bacillus -Zelle, dadurch gekennzeichnet, daB sie eine Mutation im fur die RP-ll kodierenden Gen enthalt, wobei 
25 das fur die RP-ll kodierende Gen fur ein Protein, das die Aminosauresequenz von Figur 14 umfaBt oder ein evo- 

lutionares Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat kodiert, und die Mutation zur 
Inhibierung der Produktion von proteolytisch aktivem RP-ll durch diese Zelle fuhrt. 

6. Bacillus -Zelle nach einem der vorangegangenen Anspruche, dadurch gekennzeichnet, daB sie weiterhin Mutatio- 
30 nen in den fOr extrazellulare Proteasen kodierenden apr- und npr -Genen enthalt, und die Mutationen zur Inhibie- 
rung der Produktion dieser kodierten proteolytischen Akttvitaten durch diese Zelle fuhren. 

7. Bacillus -Zelle nach einem der vorangegangenen Anspruche, dadurch gekennzeichnet, daB auBerdem die Muta- 
tion oder jede dieser Mutationen eine Deletion innerhalb der kodierenden Region des Gens enthalt. 

35 

8. Bacillus -Zelle nach einem der vorangegangenen Anspruche, dadurch gekennzeichnet, daB sie weiterhin eine 
Mutation im fur eine intrazellulare Protease kodierenden isp-1 -Gen enthalt und die Mutation zur Inhibierung des 
proteolytisch aktiven isp-l -Genprodukts durch diese Zelle fuhrt. 

40 9 - Bacillus -Zelle nach einem der Anspruche 1 bis 7, dadurch gekennzeichnet, daB sie auBerdem eine Mutation ent- 
halt, die die Fahigkeit der Zelle, eine oder mehrere sporenbildungsabhangige Protease(n) zu produzieren, herab- 
setzt. 

10. Bacillus -Zelle nach Anspruch 9, dadurch gekennzeichnet, daB weiterhin die Mutation der sporenbildungsabhan- 
45 gigen Protease die Sporenbildung in einem fruhen Stadium blockiert, jedoch nicht die Fahigkeit der Zelle, mit 

gereinigter DNA transformiert zu werden, ausschaftet. 

11. Bacillus -Zelle nach Anspruch 10, dadurch gekennzeichnet, daB sich weiterhin die Mutation der sporenbildungs- 
abhangigen Protease im spo OA-Gen befindet. 



so 



12. Bacillus -Zelle nach einem der vorangegangenen Anspruche, dadurch gekennzeichnet, daB sie des weiteren eine 
Bacillus subtilus -Zelle ist. 

13. Bacillus -Zelle nach einem der vorangegangenen Anspruche, dadurch gekennzeichnet, daB sie weiterhin ein fOr 
ein heterologes Potypeptid kodierendes Gen enthalt. 

14. Zelle nach Anspruch 13, dadurch gekennzeichnet, daB das heterotoge Polypeptid ein Hormon, \&kzin, antivirales 
Protein, Antrtumorprotein, Antikorper oder Gerinnungsprotein darstellt. 
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15. Zelle nach Anspruch 13, dadurch gekennzeichnet, daB das heterologe Polypeptid ein Pestizid oder Enzym dar- 
stellt. 

16. Verfahren zur Herstetlung eines heterologen Polypeptids in einer Bacillus -Zelle. dadurch gekennzeichnet, daB 
s man in diese Zelle ein fur dieses heterologe Polypeptid kodierendes Gen, das fur die Expression in dieser Zelle 

modifiziert ist, einfuhrt, wobei die Bacillus -Zelle in den apr - und npr-Genen Mutationen enthalt und auBerdem 
Mutationen in einem oder mehreren Gen(en), die (das) fur die epr-Protease. RP-I oder RP-II kodiert bzw. kodieren, 
enthalten sind, und die Epr, RP-I und RP-II die Aminosauresequenzen aus den Figuren 6, 10 bzw. 14 oder evo- 
lutionare Homologe davon von anderen Bacillus -spezies mit Protease-Aktivitat umfassen, und die Mutation zur 
10 Inhibierung der Produktion von proteolytisch aktiver Epr-Protease, RP-I oder RP-ll durch diese Zelle fuhrt 

17. Verfahren nach Anspruch 16, dadurch gekennzeichnet, daB weiterhin eine Mutation im fur eine intrazellulare Pro- 
tease-l kodierenden jsj>1 Gen vorhanden ist, und die Mutation zur Inhibierung der Produktion von proteolytisch 
aktivem isp-l Genprodukt durch diese Zelle fuhrt. 

75 

18. Verfahren nach Anspruch 16 oder 17, dadurch gekennzeichnet, daB weiterhin das heterologe Polypeptid in einer 
Bacillus -Zelle normalerweise instabil ist. 

19. Verfahren nach einem der Anspruche 16, 17 oder 1B, dadurch gekennzeichnet, daB die Zelle eine Bacillus subtilis - 
20 Zelle ist. 

20. Verfahren nach einem der Anspruche 16 bis 19, dadurch gekennzeichnet, daB die Zelle auBerdem eine Mutation 
enthalt, die das Vermogen der Zelle, eine oder mehrere sporenbildungsabhangige Protease(n) zu produzieren, 
herabsetzt, wobei sich die Mutation irn spoOA-Gen befindet. 

25 

21. Verfahren nach einem der Anspruche 16 bis 20, dadurch gekennzeichnet, daB das heterologe Polypeptid ein 
Hormon, Vakzin, antivirales Protein, Antitumorprotein, Antikorper, Gerinnungsprotein, Pestizid oder Enzym dar- 
stellt. 

30 22. Gereinigte DNA mit einem epr-Gen aus Bacillus, worin das Gen fur die Aminosauresequenz von Figur 6 oder ein 
evolutionares Homologes davon von anderen Bacillus-Spezien mit Protease-Aktivitat kodiert. 

23. Gereinigte DNA mit einem fur die RP-I kodierenden Bacillus -Gen, worin dieses Gen fur die Aminosauresequenz 
von Figur 1 0 oder ein evolutionares Homologes davon von anderen Bacillus- Spezies mit Protease-Aktivitat kodiert. 

35 

24. Gereinigte DNA mit einem fur die RP-II kodierenden Bacillus -Gen, worin das Gen fur die Aminosauresequenz von 
Figur 14 oder ein evolutionares Homologes davon von anderen Bacillus-Spezies mit Protease-Aktivitat kodiert. 

25. Vektor mit einem ep_r-Gen aus Bacillus, worin das Gen fur die Aminosauresequenz von Figur 6 oder ein evolutio- 
40 nares Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat kodiert und regulatorischen DNA- 

Elementen, die funktionell mit diesem Gen verbunden sind. 

26. Vektor mit einem fur die RP-I kodierenden Bacillus -Gen, worin das Gen fur die Aminosauresequenz von Figur 10 
oder ein evolutionares Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat kodiert und anderen 
regulatorischen DNA-Elementen, die funktionell mit diesem Gen verbunden sind. 

27. Vektor mit einem fOr die RP-II kodierenden Bacillus -Gen, worin das Gen fOr die Aminosauresequenz von Figur 14 
oder ein evolutionares Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat kodiert und anderen 
regulatorischen DNA-Elementen, die funktionell mit diesem Gen verbunden sind. 

so 

28. Bacillus -Zelle. die mit einem Vektor nach einem der Anspruche 25, 26 und 27 transformiert ist. 

29. Im wesentlichen reine Epr-Protease aus Bacillus , die die Aminosauresequenz von Figur 6 oder ein evolutionares 
Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat umfaBt. 

55 

30. Im wesentlichen reine Restprotease I (RP-I) aus Bacillus , die die Aminosauresauresequenz von Figur 10 oder ein 
evolutionares Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat umfaBt. 
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31. Im wesentlichen reine Restprotease tl (RP-ll) aus Bacillus , die die Aminosauresequenz von Figur 14 oder ein 
evolutionares Homologes davon von anderen Bacillus -Spezies mit Protease-Aktivitat umfaGt. 



s Revendications 

1. Cellule de Bacillus caracteris6e en ce qu'elle contient une mutation dans le gene ep_r, ledit gene ep_r codant une 
proteine comprenant la sequence d'acides amines de la figure 6 ou un homologue 6volutionnaire de celle-ci 
d'autres especes de Bacillus , ayant une activity de protease, ladite mutation entraTnant une inhibition de la pro- 

io duction par ladite cellule d'un produit du gene ep_r actif du point de vue proteolytique. 

2. Cellule de Bacillus selon la revendication 1 , caracterisde en ce qu'elle contient en outre une mutation dans le gene 
codant la RP-I, ledit gene codant la RP-I codant une proteine comprenant la sequence d'acides amin6s de la figure 
10 ou un homologue 6votutionnaire de celle-ci d'autres especes de Bacillus, ayant une activity de protease, ladite 

1 $ mutation entraTnant I'inhibition de la production par ladite cellule d'une RP-I active du point de vue proteolytique. 

3. Cellule de Bacillus caract6ris6e en ce qu'elle contient une mutation dans le gene codant la RP-I, ledit gene codant 
la RP-I codant une proteine comprenant la sdquence d'acides amines de la figure 10 ou un homologue 6volution- 
naire de celle-ci d'autres especes de Bacillus , ayant une activite de protease, ladite mutation entraTnant I'inhibition 

20 de la production par ladite cellule d'une RP-I active du point de vue proteolytique. 

4. Cellule de Bacillus selon Tune quelconque des revendications precddentes, caracteris6e en ce qu'elle contient en 
outre une mutation dans le gene codant la RP-ll, ledit gene codant la RP-ll codant une proline comprenant la 
sequence d'acides amines de la figure 1 4 ou un homologue eVolutionnaire de celle-ci d'autres especes de Bacillus , 

25 ayant une activite" de protease, ladite mutation entraTnant I'inhibition de la production par ladite cellule d'une RP- 

ll active du point de vue proteolytique. 

5. Cellule de Bacillus caracterisSe en ce qu'elle contient une mutation dans le gene codant la RP-ll, ledit gene codant 
la RP-ll codant une proline comprenant la sequence d'acides amines de la figure 14 ou un homologue eVolution- 

30 naire de celle-ci d'autre especes de Bacillus , ayant une activity de protease, ladite mutation entraTnant I'inhibition 

de la production par ladite cellule d'une RP-ll active du point de vue proteolytique. 

6. Cellule de Bacillus selon I'une quelconque des revendications pr6c6dentes, caracteris6e en ce qu'elle contient en 
outre des mutations dans fes genes ap_r et npr codant des proteases extracetlulaires, lesdites mutations entraTnant 

35 I'inhibition de la production par ladite cellule desdites activites proteolytiques cod6es. 

7. Cellule de Bacillus selon I'une quelconque des revendications precddentes, caracteris6e en outre en ce que ladite 
mutation ou chacune desdites mutations comprend une d6l6tion dans la region codante du gene. 

40 8. Cellule de Bacillus selon I'une quelconque des revendications ptecddentes, comprenant en outre une mutation 
dans le gene isp-1 codant une protease intracellulaire, ladite mutation entraTnant I'inhibition de ia production par 
ladite cellule d'un produit du gene isp-l actif du point de vue proteolytique. 

9. Cellule de Bacillus selon I'une quelconque des revendications 1 a 7, caract6ris6e en ce qu'elle contient en outre 
45 une mutation qui tedurt la capacite de ladite cellule a produire une ou plusieurs proteases d6pendantes de la 

sporulation. 

10. Cellule de Bacillus selon la revendication 9, caracterisde en outre en ce que ladite mutation des proteases d6pen- 
dantes de la sporulation bloque la sporulation a un stade prScoce mais n r 6limine pas I'aptitude de la cellule a Stre 

50 transformed par un ADN purifie\ 

1 1 . Cellule de Bacillus selon la revendication 1 0, caracterisSe en outre en ce que ladite mutation de proteases depen- 
dant es de la sporulation est dans le gene spo OA. 

55 12. Cellule de Bacillus selon I'une quelconque des revendications pr6c6dentes, caracteris6e en outre en ce qu'il s'agit 
d'une cellule de Bacillus subtilis . 

13. Cellule de Bacillus selon I'une quelconque des revendications pr6c6dentes, caracterisde en ce qu'elle comprend 
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en outre un gene codant un polypeptide necrologue. 

14. Cellule selon la revendication 13, caracteris6e en outre en ce que ledit polypeptide necrologue est une hormone, 
un vaccin, une proteine antivirale, une proteine antitumorale, un anticorps ou une proteine de coagulation. 

5 

15. Cellule selon la revendication 13, caracteris6e en outre en ce que ledit polypeptide necrologue est un pesticide 
ou une enzyme. 

16. Proc6d6 de production d'un polypeptide rteterologue dans une cellule de Bacillus , caracteris6 en ce qu'il 
io comprend : 1'introduction dans ladite cellule d'un gene codant ledit polypeptide rteterologue, modifi6 pour Stre 

exprinrte dans ladite cellule, ladite cellule de Bacillus contenant des mutations dans les genes apr et npr , et con- 
tenant en outre des mutations dans un ou plusieurs des genes codant la protease Epr, RP-I ou RP-II, lesdites Epr, 
RP-1 et RP-II comprenant les sequences d'acides amines representees sur les figures 6, 10 et 14 respectivement, 
ou des homologues evolutionnaires de celles-ci d'autres especes de Bacillus ayant une activity de protease, ladite 
*s mutation entralnant ('inhibition de la production par ladite cellule de protease Epr, de RP-I ou de RP-II active du 

point de vue proteolytique. 

17. Proc6de selon la revendication 16, caracterise en ce qu'il comprend en outre une mutation dans le gene isp-1 
codant la protease intracellular I, ladite mutation entrainant inhibition de la production par ladite cellule d'un 

20 produit du gene isp-l actif du point de vue proteolytique. 

18. Proc6d6 selon les revendications 16 ou 17, caracteris6 en outre en ce que ledit polypeptide rteterologue est nor- 
malement instable dans une cellule de Bacillus . 

25 19. Prcc6d6 selon Tune quelconque des revendications 16, 17 ou 18, caracterise en outre en ce que ladite cellule est 
une cellule de Bacillus subtillis . 

20. Proc6d6 selon Tune quelconque des revendications 16 a 19, caracterise" en outre en ce que ladite cellule contient 
en outre une mutation qui r6duit la capacite de ladite cellule a produire une ou plusieurs proteases dependantes 

30 de la sporulation, ladite mutation etant dans le gene spo OA. 

21. Proc6d6 selon I'une quelconque des revendications 16 a 20, caracteris6 en outre en ce que ledit polypeptide 
rteterologue est une hormone, un vaccin, une proline antivirale, une proteine antitumorale, un anticorps, une 
proteine de coagulation, un pesticide ou une enzyme. 

35 

22. ADN purifid comprenant un gene epr de Bacillus , ledit gene codant la sequence d'acides amines de la figure 6 ou 
un homologue dvolutionnaire de celle-ci d'autres especes de Bacillus , ayant une activite de protease. 

23. ADN purifi6 comprenant un gene de Bacillus codant RP-I, ledit gene codant la sequence d'acides amines de la 
40 figure 10 ou un homologue 6volutionnaire de celle-ci d'autres especes de Bacillus , ayant une activity de protease. 

24. ADN purifi6 comprenant un gene de Bacillus codant RP-II, ledit gene codant la sequence d'acides amin6s de la 
figure 14 ou un homologue 6volutionnaire de celle-ci d'autres especes de Bacillus, ayant une activite de protease. 

*5 25. Vecteur comprenant un gene ep_r de Bacillus ledit gene codant la sequence d'acides amrn6s de la figure 6 ou un 
homologue evolutionnaire de celle-ci d'autres especes de Bacillus , ayant une activite de protease, et un ADN 
regulateur associ6 de maniere active audit gene. 

26. Vecteur comprenant un gene de Bacillus codant RP-I, ledit gene codant la sequence d'acides amines de la figure 
50 10 ou un homologue evolutionnaire de celle-ci d'autres especes de Bacillus , ayant une activite de protease, et un 

ADN regulateur associe de maniere active audit gene. 

27. Vecteur comprenant un gene de Bacillus codant RP-II, ledit gene codant la sequence d'acides amines de la figure 
14 ou un homologue evolutionnaire de celle-ci d'autres especes de Bacillus , ayant une activite de protease, et un 

55 ADN tegulateur assocte de maniere active audit gene. 

28. Cellule de Bacillus transformee avec un vecteur selon Tune quelconque des revendications 25, 26 et 27. 
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29. Protease Epr de Bacillus sensiblement pure comprenant la sequence d'acides amines de la figure 6 ou un homo- 
logue evolutionnaire de celle-ci d'autres especes de Bacillus , ayant une activity de protease. 

30. Protease r6siduelle I (RP-I) de Bacillus sensiblement pure comprenant la sequence d'acides amines de la figure 
5 10 ou un homologue evolutionnaire de celle-ci d'autres especes de Bacillus , ayant une activite de protease. 

31. Proteose r6siduelle II (RP-II) de Bacillus sensiblement pure comprenant la sequence d'acides amines de la figure 
14 ou un homologue Evolutionnaire de celle-ci d'autres especes de Bacillus , ayant une activity de protease. 

10 
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FIG, 1 
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FIG. 3 
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FIG. 6=1 

-446 ATCCGAGCTTATC6GCCCACTCGTTCCCAAACACACTCGCCATGAAATCAGCATACCCC 
GGAATCGGCAAGCTCGTTAAAATCAAGAAGACAGACCCGATAATAATCAGCGGCATGGT 
CAGGATAATCCGTCACGCAAAGCGCTGAGATGCCGCTGCCCGGCAATTTTCCCGGCGAC 
AGGCATTATTTTTTCCTCCATCACCCGAGTGAATGTGCTCATCTTAAAAACCCCCTTTT 
CTCATTGCTTTGTGAACAACCTCCGCAATGTTTTCTTTATCTTATTTTGAAAACGCTTA 
CAAATTCATTTGGAAAATTTCCTCTTCATGCGGAAAAAATCTGCATTTTGCTAAACAAC 
CCTGCCCATGAAAAATTTTTTCCTTCTTACTATTAATCTCTCTTTTTTTCTCCGATATA 
TATATCAAACATCATAGA&AAfiMIGMTC 
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FIG. 6=2 

406 ATC AAA ATT GCC 6TC ATT GAC AGC GGG ATC TCC CCC CAC GAT GAC 
lie lys lie ala val lie asp ser gly 1le sen pro Ms asp asp 

451 CTG TCG ATT GCC GGC GGG TAT TCA GCT GTC AGT TAT ACC TCT TCT 
leu ser lie ala gly gly tyr ser ala val ser tyr thr ser ser 

496 TAC AAA GAT GAT AAC GGC CAC GGA ACA CAT GTC GCA GGG ATT ATC 
tyr lys asp asp asn gly his gly thr his val ala gly ile lie 

541 GGA GCC AAG CAT AAC GGC TAC GGA ATT GAC GGC ATC GCA CCG GAA 
gly ala lys his asn gly tyr gly lie asp gly lie ala pro glu 

586 GCA CAA ATA TAC GCG GTT AAA 6CG CTT GAT CAG AAC GGC TCG GGG 
ala gin lie tyr ala val lys ala leu asp gin asn gly ser gly 

631 GAT CTT CAA AGT CTT CTC CAA GGA ATT GAC TGG TCG ATC GCA AAC 
asp leu gin ser leu leu gin gly ile asp trp ser 1le ala asn 

676 AGG ATG GAC ATC GTC AAT ATG AGC CTT GGC ACG ACG TCA GAC AGC 
arg met asp lie val asn met ser leu gly thr thr ser asp ser 

721 AAA ATC CTT CAT GAC GCC GTG AAC AAA GCA TAT GAA CAA GGT GTT 
lys lie leu his asp ala val asn lys ala tyr glu gin gly val 

766 CTG CTT GTT GCC GCA AGC GGT AAC GAC GGA AAC GGC AAG CCA GTG 
leu leu val ala ala ser gly asn asp gly asn gly lys pro val 

811 AAT TAT CCG GCG GCA TAC AGC AGT GTC GTT GCG GTT TCA GCA ACA 
asn tyr pro ala ala tyr ser ser val val ala val ser ala thr 

856 AAC GAA AAG AAT CAG CTT GCC TCC TTT TCA ACA ACT GGA GAT GAA 
asn glu lys asn gin leu ala ser phe ser thr thr gly asp glu 

901 GTT GAA TTT TCA 6CA CCG GGG ACA AAC ATC ACA AGC ACT TAC TTA 
val glu phe ser ala pro gly thr asn-lle ihr ser thr tyr leu 
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FIGo 6=3 
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FIGo 6=4 

I486 GTT GCG AAA GCG GAA AAA AGC AAA AAG AAA ACA GAT GTG GAC AGC 

val ala lys ala glu lys ser lys lys lys thr asp val asp sen 

1531 GCA CAA TCA GCA ATT GGC AAG CTG CCT GCA AGT TCA GAA AAA ACG 

ala gin ser ala 11e gly lys leu pro ala ser ser glu hs thr 

P<tt. I 

1576 TCC CTG CAG AAA CGC CTT AAC AAA GTG AAl AGC ACC AAT TTG AAG 

ser leu gin lys arg leu asn lys val ly; ser thr asn leu lys 



1621 ACG 
thr 



GCA CAG CAA TCC GTA TCT GCG GCT GAA AAG AAA TCA ACT GAT 
ala gin gin ser val ser ala ala glu lys lys ser thr asp 



1666 GCA AAT GCG GCA AAA GCA CAA TCA GCC GTC AAT CAG CTT CAA GCA 
ala asn ala ala lys ala gin ser ala val asn gin leu gin ala 

1711 GGC AAG GAC AAA ACG GCA TTG CAA AAA CGG TTA GAC AAA GTG AAk 
gly lys asp lys thr ala leu gin lys arg leu asp lys val lys 

1756 AAA AAG GTG GCG GCG GCT GAA GCA AAA AAA GTG GAA AcF GCA AAG 
lys lys val ala ala ala glu ala lys lys val glu thr ala lys 

1801 GCA AAA GTG AAG AAA GCG GAA AAA GAC AAA ACA AAG AAA TCA AAG 
ala lys val lys lys ala glu lys asp lys thr lys lys ser lys 
Pstt 

1846 ACA TCC GCT CAG TCT GCA GTG AAT CAA TTA AAA GCA TCC AAT GAA 
thr ser ala gin ser ala val asn gin leu lys ala ser asn glu 

1891 AAA ACA AAG CTG CAA AAA CGG CTG AAC GCC GTC fA CCG AAA 
lys thr lys leu gin lys arg leu asn ala val ly) pro lys 

1936 AAG TAA CCAAAAACCTTTAAGAHI£CATTCCAA STCTTAAAS6TTTTT TT 

lys ooo 

1994 CATTCTAAGA 
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The underlined portion is the approximate 
location of the RP-I gene on the PstI 
fragment. 
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The shaded box represents the regtaa to which the 
RP-H "guess-mars* hybridized. 
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FIGo 1(0=1 

-199 TAACMACA6ATAAnAeACKATnAnnil6A6ATTTTAieAnTeAtATAlAl 
GGAAAT76AAC6AeAC6AAACtAeMTATgT6TAAn€A6ATT6TCIAeASTTAAIA 
TACA6e6AT6TTeT|A°AAACeAT?€ATTAnAAAA66A666AeiACAeTnTITTA 

AAAAfiCA7GTT6AAAAA66666AY6AM 

<\ ATO AG6 AAA AAA ACI AAA AAC AGA 6TG A76 AGS T6T GT? TU AST 
mi drg )y$ ly$ thr )y$ d$ft dtp ha Uq wr $QP vol Uu W 

46 ACA 5TT ITC ATC AST TU €T6 6T§ T7T 6CG GSA 6CA 166 III GGA 
ittr nl'n\ \U sq? $8? ley ley pf§ oU old sly oh 

91 AGC AST AAA- GYC ACC TGA ggl T6Y ITT AAA AAI SA6 6TT 6M 1ST 
$er set ly$ n) the pr§ ser vol lys Jys ffly la» §fa see 

136 GCG GAA TCC ATT CM AAg AAI ATT T6I AGT T6A TTA MG AAA AGC 
e/<j sly ser fl<? ejfl asa Pys Jig sef see sgp flfly ?y« lys sqp 

181 ITT AAA AAG AAA GAA AAA AGS ACT TTT 6T6 ATT AM TIT AMU 
)/« J/i ly$ §U )y$ m ®r $e ha llo lys $U lys asp 

Hi CTO GGT AAC GGA IAA AM ggg MA 666 6CT §1? AAA AAA SeS 
ley a la ai« pet -flu lys old old lys did old vol lys lys old 

11) AAA TCI AAI AAS CTg TO 666 66T Ml A66 6M TAT 6AA M6 €67 
lys ser lys lys hv see did did lys fcftf 3 eto iw fjfo 0f0 

316 TCT G6T ITT 6T6 T6A T6T TO AAA 6T6 A6A 666 HAT T66 &Ai 
sw ala vd$ vol see sqit U& lys m gfip did osp q1v m gla 

361 CAA GAT ITC 6TA AAA TA6 HI MS A£g @A6 AM SAT AM 66A Ml 
gin asp vaJ ley lys $pf Sew SAp @l® lys lys gly esm 

40S GCA GAC CAA ATT GAT m TAT TAT ITS 616 AA6 SIS ATT GGT 6TT 
9ld d$p §la ilg his sen 3 fye wl troJ o$® ffly tto dU val 

451 CAT GCG TGA AM 6A6 ITT AT6 §M AM m ST6 m TTT €66 6M 
Ms did «f lys 0l» va? o§g >a? traJ ^Ifl ^fia pre 0?^ 
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FIG* 10=2 
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FIGo 10=4 



lfft GAB AM 6TA A6? SAA §CT TAT IAA 1ST ISC ASC CTA CCA CTG ACT 
§ly lys vdi W lw did typ 0?0 0fr 0ty ief ley pre ley t/ir 

im TT6 A6A deT GAA SAG AAT ITS A§T ITS ACA T€T STA AAG CTG TCC 
Jet* Sfip ajs asp as« roJ sep pafl thr $qp nl ly$ ley $§r 

mi m m m m m m m T66 aca iaa ata acs set am cga 

typ l-ys Uts dsp §U sly gits Up thp §h Ite thp did )y$ itg 

im Ate aimit sat sat sta aaa ssa m tat m sca m atc cca 

llo sqp sly d$p Ms 1w lys sly the typ §U ah gh ite pro 

1801 6AT ATA AAA SSA AST AAA CTA A6£ TAT AAI Til ATS ATT CAC GAT 
dzp llo Hy$ gly top Igb sqp typ Sys fcpp ®e£ Uq Ms isp 

114$ tit lie giT cat iie in Teg tct §ac ita ta€ gat sta aca gtg 

pite fjly $y Mi vdi vol $g? sqp dsp vol typ mp nl thr vsl 

mi aaa &a Aie ak asi see iia tat aas sai sac ttt saa act gca 

ly$ m sqp Mq ZhP did sly %yp ly§ $1® d§p phe glu thr sh 

i&g eee sse lie tii-itt ges Aie iia aca aat aat aac tii saa tgg 

pp§ sly top vol old sqp ®ly thp a&a dsn d$n %rp glv trp 

mi m smr m tsa ast m ^ aat asa sca §eA Tee ssa gaa aaa 

Sly vol pp© m sap sly $f§ ossa thp did old $er gly glv lys 

m$> ita ta? mk m mt tts m m at? ats ea act cai caa aca 

vol Sfif asa tfip f?e @eg ^p@ ^p ^Ifl tiir 

2071 TiA AgeWinATieSTeeTATIAAAieAeeTIATTeAllAAiTeTiTTCCTTCAATT 

m 

TmiCTIieAeAATTTAiAigATiATnnrTSAnAeiSeTAeiTTTTTITTCTTCCGGA 
AiSTSAAAAIAATTISIAiSAIgeT^TgTOTAeilAAgeTeAAieTIGACGSACGAG 
AAI6TTATe§geTTATAA6 
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